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TITLE OF THE INVENTION 

COORDINATE U± VIVO GENE EXPRESSION 

CROSS-RELATED TO OTHER APPLICATIONS 

This is a continuation-in-part of U. S. Serial Number 
08/207,526, filed March 7, 1994, now pending. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

A method for coordinate expression in a single cell, in 
vivo, of exogenous genes via introduction into the tissue of a vertebrate 
of polycistronic polynucleotide constructs is described. The method 
results in production of immune responses against the products 
produced as a result of expression of the exogenous genes. The method 
and polynucleotide constructs of this invention may be used in a 
vertebrate to generate immune responses against antigenic epitopes 
expressed by a single cell. The coordinate expression results in 
improved expression of gene products which may be otherwise poorly 
expressed. It also results in improved cellular immune responses due to 
provision of T-cell stimulatory signals by the same cell expressing T- 
cell antigens. Polynucleotide constructs encoding human 
immunodeficiency virus (HIV) antigens exemplify one embodiment of 
the method. 

2. Background of the Invention 

A major challenge to the development of vaccines against 
viruses, particularly viruses with a high rate of mutation such as HIV, 
against which elicitation of neutralizing and protective immune 
responses is desirable, is the diversity of the viral envelope proteins 
among different viral isolates or strains. Because cytotoxic T- 
lymphocytes (CTLs) in both mice and humans are capable of 
recognizing epitopes derived from conserved internal viral proteins and 
may be important in the immune response against viruses, efforts have 
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been directed towards the development of CTL vaccines that elicit 
heterologous protection against different viral strains. 

CD8+ CTLs kill virally-infected cells when their T cell 
receptors recognize viral peptides associated with MHC class I 
molecules. These peptides are derived from endogenously synthesized 
viral proteins. Thus, by recognition of epitopes from conserved viral 
proteins, CTLs may provide cross-strain protection. Peptides capable 
of associating with MHC class I for CTL recognition originate from 
proteins that are present in or pass through the cytoplasm or 
endoplasmic reticulum. Exogenous proteins which enter the endosomal 
processing pathway (as in the case of antigens presented by MHC class II 
molecules) are not usually effective in generating CD8+ CTL responses. 

Efforts to generate CTL responses have used replicating 
vectors to produce the protein antigen within the cell or have introduced 
peptides into the cytosol. These approaches have limitations that may 
limit their utility as vaccines. Retroviral vectors have restrictions on 
the size and structure of polypeptides that can be expressed as fusion 
proteins while maintaining the ability of the recombinant virus to 
replicate. Further, the effectiveness of vectors such as vaccinia for 
subsequent immunizations may be compromised by immune responses 
against the vectors themselves. Also, viral vectors and modified 
pathogens have inherent risks that may hinder their use in humans [R.R. 
Redfield et al, New Engl. J. Med. 316, 673 (1987); L. Mascola et al.. 
Arch. Intern. Med. 149, 1569 (1989)]. Furthermore, the selection of 
peptide epitopes to be presented is dependent upon the structure of an 
individual's MHC antigens; thus, peptide vaccines may have limited 
effectiveness due to the diversity of MHC haplotypes in outbred 
populations. 

Benvenisty, N., and Reshef, L. [PNAS 83, 9551-9555, 
(1986)] showed that CaCl2-precipitated DNA introduced into mice 
intraperitoneal ly (i.p.), intravenously (i.v.) or intramuscularly (i.m.) 
could be expressed. Intramuscular injection of DNA expression vectors 
in mice results in the uptake of DNA by the muscle cells and expression 
of the protein encoded by the DNA [J. A. Wolff et al.. Science 247, 
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1465 (1900); G. Ascadi et a!,, Nature 352, 815 (1991 )J. The plasmids 
were maintained episomally and did not replicate. Subsequently, 
persistent expression has been observed after i.m. injection in skeletal 
muscle of rats, fish and primates, and cardiac muscle of rats. The 
technique of using nucleic acids as therapeutic agents was reported in 
WO90/I 1092 (4 October 1990), in which naked polynucleotides were 
used to vaccinate vertebrates. 

It is not necessary for the success of the method that 
immunization be intramuscular. Thus, Tang et al., [Nature, 356, 152- 
154 (1992)] disclosed that introduction of gold microprojectiles coated 
with DNA encoding bovine growth hormone (BGH) into the skin of 
mice resulted in production of anti-BGH antibodies in the mice. Furth 
et al., [Anal. Biochem. 205, 365-368, (1992)] showed that a jet injector 
could be used to transfect skin, muscle, fat, and mammary tissues of 
living animals. Methods for introducing nucleic acids was recently 
reviewed by Friedman, T., [Science, 244, 1275-1281 (1989)]. 
Robinson et aL, [Abstracts of Papers Presented at the 1992 meeting on 
Modem Approaches to New Vaccines, Including Prevention of AIDS, 
Cold Spring Harbor, p92] reported that i.m., i.p., and i.v. 
administration of avian influenza DNA into chickens provided 
protection against lethal challenge. However, Robinson et al. did not 
disclose which avian influenza virus genes were used. In addition, only 
H7 specific immune responses were alleged; the induction of cross- 
strain protection was not discussed. Intravenous injection of a 
DNAxationic liposome complex in mice was shown by Zhu et al., 
[Science 2^1:209-21 1 (9 July 1993); see also WO93/24640, 9 Dec. 1993] 
to result in systemic expression of a cloned transgene. Recently, Ulmer 
et al., [Science 259:1745-1749, (1993)] reported on the heterologous 
protection against influenza virus infection by injection of DNA 
encoding influenza virus proteins. 

The need for specific therapeutic and prophylactic agents 
capable of eliciting desired immune responses against pathogens and 
tumor antigens is achieved by the instant invention. Of particular 
importance in this therapeutic approach is the ability to induce T-cell 
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immune responses which can prevent infections or disease caused by 
virus strains which are heterologous to the strain from which the 
antigen gene was obtained. This is of significance with HIV, since HIV 
mutates rapidly, and because many virulent isolates have been identified 
[see. for example, LaRosa et al.. Science 249:932-935 (1990), 
identifying 245 separate HIV isolates]. 

In response to this diversity, researchers have attempted to 
generate CTLs by peptide immunization. Thus, Takahashi et al., 
[Science 255:333-336 (1992)] reported on the induction of broadly 
cross-reactive cytotoxic T cells recognizing an HIV envelope (gpl60) 
determinant. They recognized the difficulty in achieving a truly cross- 
reactive CTL response and suggested that there is a dichotomy between 
the priming or restimulation of T cells, which is very stringent, and the 
elicitation of effector function, including cytotoxicity, from already 
stimulated CTLs. 

Wang et al., fP.N.A.S. USA 90:4156-4160 (May, 1993)] 
reported on elicitation of immune responses in mice against HIV by 
intramuscular inoculation with a cloned, genomic (unspliced) HIV 
gene. The level of immune response achieved was low, and the system 
utilized portions of the mouse mammary tumor virus (MMTV) long 
terminal repeat (LTR) promoter and portions of the simian virus 40 
(SV40) promoter and terminator. SV40 is known to transform ceils, 
possibly through integration into host cellular DNA. Therefore, unlike 
the system described herein, the system described by Wang et al. may be 
inappropriate for administration to humans. In addition, the DNA 
construct of Wang et al. contains an essentially genomic piece of HIV 
encoding contiguous Tat/REV-gp 1 60-Tat/REV coding sequences 
(Figure 1). As is described in detail below, this is a suboptimal system 
for obtaining high-level expression of the gpl60. One drawback is that 
the expression of Tat has been recognized to play a contributory role in 
the progression of Kaposi's Sarcoma, [Y.N. Vaishav and F.W. Wong- 
Staal, An. Rev. Biochem. (1991)]. 

WO 93/17706 describes a method for vaccinating an animal 
against a virus, wherein carrier particles were coated with a gene 
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construct and the coated particles are accelerated into cells of an animal. 
In regard to HIV, essentially the entire genome, minus the long terminal 
repeats, was proposed to be used. That method may represent a 
substantial risk for recipients. Constructs of HIV should, in general, 
contain less than about 50% of the HIV genome to ensure safety of the 
vaccine. Thus, a number of problems remain if a useful human HIV 
vaccine is to emerge from the gene-delivery technology. 

The instant invention uses known methods for introducing 
polynucleotides into living tissue to induce expression of proteins. This 
invention provides a immunogen for introducing HIV and other 
proteins into the antigen processing pathway to efficiently generate 
HIV-specific CTLs and antibodies. The pharmaceutical is effective as a 
vaccine to induce both cellular and humoral anti-HIV and HIV 
neutralizing immune responses. The instant invention addresses some of 
the problems by providing polynucleotide immunogens which, when 
introduced into an animal, direct the efficient expression of HIV 
proteins and epitopes without the attendant risks associated with those 
methods. The immune responses generated are effective at recognizing 
HIV, at inhibiting replication of HIV, at identifying and killing cells 
infected with HIV, and are cross-reactive against many HIV strains. 
Therefore, this invention provides a useful immunogen against HIV. 
The invention also provides polynucleotide constructs which enable the 
co-expression, in vivo , of more than one gene-product in a single cell. 
This is demonstrated with an HIV gene expression system in which the 
expression of a first gene is dependent on the co-expression in the same 
cell of a second gene product. By virtue of the success of achieving this 
co-expression in vivo , it is now predictable that this type of 
polynucleotide construct may be applied to co-expression in vivo of 
many combinations of gene products, including but not limited to viral 
antigens other than HIV related antigens, carcinoma-associated antigens, 
and immunomodulatory or immunostimulatory gene products. 
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SUMMARY OF THF IN VFNTFONi 

Nucleic acids, including DNA constructs and RNA 
transcripts, capable of inducing coordinate expression of two to three 
cistrons upon direct introduction into animal tissues, are presented. In 
one embodiment, coordinate expression of two cistrons encoding HIV 
proteins and elicitation of HIV specific immune responses against more 
than one gene products is demonstrated. Cytotoxic T lymphocytes 
(CTLs) specific for viral antigens which respond to different strains of 
human immunodeficiency virus (HIV), and antibodies which are 
generally strain-specific are generated. The generation of such CTLs in 
Yivo usually requires endogenous expression of the antigen, as in the 
case of virus infection. To generate a viral antigen for presentation to 
the immune system, without the limitations of direct peptide delivery or 
the use of viral vectors, polynucleotides encoding HIV proteins are 
directly introduced into tissues of vertebrates in vivo, the 
polynucleotides are taken up by cells within the tissue, and the encoded 
proteins produced and processed for presentation to the immune system. 
In mice, this resulted in the generation of HIV-specific CTLs and 
antibodies. Similar results are achieved in primates. These results are 
achieved with bi- or tri-cistronic nucleic acid polynucleotides encoding 
and co-expressing HIV gene products, immunostimulatory gene 
products including but not limited to GM-CSF, interleukins, interferon 
and B7 proteins, which act as T-cell costimulatory elements. The 
methods and polynucleotides of this invention are generally applicable 
to co-ordinate expression in yivp_ of any two or three genes. Thus, 
various embodiments of this invention include coordinate expression in 
yjv£ of viral antigens and immunostimulatory gene products as well as 
coordinate expression of tumor antigens and immunostimulatory genes. 

BRIEF DESC RIPTION OF THE DRAWTNr.S 
Fig. 1 . A schematic representation of the HIV genome. 
Fig. 2. A schematic representation of a polynucleotide construct of 
this invention capable of inducing the co-ordinate expression 
in vivo in a single cell of up to three gene products encoded 
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by each of three cistrons (I. II. and III). The segments A and 
B represent control sequences including transcription 
termination signals and promoters or internal ribosome entry 
sites (IRES). 

Fig. 3. Detailed schematic of an HIV env polynucleotide immunogen 
construct comprising the CMV-intA transcription promoter, a 
5'-splice donor, HIV gp!60 (showing gpI20, gp41, and the 
REV-responsive element, RRE), an internal ribosome entry 
site (IRES), the REV cistron, the BGH transcription 
terminator, and the neomycin resistance marker which is 
driven by a prokaryotic transcription promoter.. 

Fig. 4. Detailed schematic of dicistronic HIV env and gag 

polynucleotide immunogen constructs showing specific 
regulatory elements. 

Fig. 5. Western blot analysis of gpl60 expression induced by HIV 

polynucleotide immunogens. This result rigorously shows the 
coexpression in a single cell of more than one gene product 
from a single polynucleotide construct: A polynucleotide 
encoding gpl60 alone (see panel B, fourth lane from the left) 
expresses no detectable gp!60, but with REV added in trans 
(by cotransfection of a construct encoding only REV), there is 
good gpl60 expression (panel A, fourth lane from the left). 
A genomic tat/REV/env construct expresses only low levels of 
gpl60, whether or not REV is provided in trans (panels A 
and B, third lane). However, a dicistronic gpl60/IRES/REV 
construct heavily expresses gpl60 (panels A and B, fifth lane 
from the left). The best expression, is obtained in a 
dicistronic construct encoding gpl60/IRES/REV, with a splice 
donor (SD) provided 5' to the gpl60 coding sequence (panels 
A and B, right hand lane). Because no additional expression 
is achieved when additional REV is provided in trans (panel A 
right hand lane), the system is not limited by the level of REV 
being expressed. 
Fig. 6. VI J Sequence. 
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Fig. 7. VUneo Sequence. 

Fig. S. CMVintABGH Sequence. 

Fig. 9. Cytotoxic T lymphocytes generated in rhesus monkeys in 

response to VU-SIV-p28 polynucleotide construct vaccination 
(REV independent). This SIV p28 is equivalent to p24 gag of 
HJV. Thus, CTLs specific to a group specific antigen are 
inducible using a gag encoding polynucleotide construct. 

Fig. 1 0. Cytotoxic T lymphocytes generated in response to Vaccinia- 
SIVp28 nucleic acid vaccination. This demonstrates that 
similar CTLs are induced by a gag encoding polynucleotide 
(figure 9) as compared with a replicating antigen (vaccinia) 
expressing the same antigen [see Shen, L., et aL, Science 
252:440-443, 1991]. 

Fig .11. Sequence of the Vector VI R. 

Fig. 12. Antibodies induced by VUns-tPA-gpl 20, 200 |ig/mouse per 
round. 2 rounds. 

Fig. 13. Neutralization of HIV-1 (MN) virus by sera from VUns- 
tPA-gpl20 (MN) DNA vaccinated African Green Monkeys. Panels a 
and B show the reduction in p24 gag protein production for C8I66 cells 
infected with HIV-1 (MN) following exposure to the indicated dilutions 
of sera from VI Jns-tPA-gpl20 DNA vaccinated monkeys. Data was 
obtained after 10 days in tissue culture following virus inoculation 
(TCID50 per sample). 

Fig. 14 T cells from VI Jns-tPA-gpI20 vaccinated mice exhibiting 
long-term, antigen-specific T lymphocyte memory responses. 
Immunized mice received 1.6 meg of vaccine DNA twice, six months 
prior to sacrifice. Splenic T cells were cultured in vitro with 
recombinant gpl20 protein at 5 mcg/mL. Proliferation of gpl20- 
specific T cells. A stimulation index (SI; incorporated ^H-thymidine 
for gpl20 treatedT cells:T cells that did not receive antigen). 
Fig. 15. Type 1 T helper (ThD lymphocyte cytokine secretion by T 
cells from VUns-tPA-gpl20 DNA vaccinated mice. Cell culture 
supernatants from the samples shown in Figure 13 were assayed from 
gamma-interferon and interleukin 4 (IL-4) secretion following 



WO 95/24485 



PCT/LS95/02633 



10 



15 



20 



25 



30 



treatment with rgp!20. Immune mice secreted large amounts of 
gamma- interferon and very low amounts of 1L-4 indicated that THl- 
Hke responses were induced by this vaccine. Control mice showed very 
low amounts of interferon secretion while the IL-4 levels indicated are 
background levels. 

Fig. 15 Anti-gp 120 cytotoxic T lymphocyte (CTL) activities in 
VUns-tPA-gpl20 DNA vaccinated mice. Two mice (2006 and 2008) 
showed MHC I restricted CTL activities specific to a gpl20 peptide 
(PI 8) following gpl20 DNA vaccinations. No activities were observed 
for these mice in the absence of PI 8 or by a control mouse which had 
not been previously vaccinated. 

Fig. 16. Anti-gp 160 CTL activities by rhesus monkeys vaccinated 
with VlJns-gpl60/IRES/rev and V Uns-tPA-gpI20 DNA vaccines. T 
lymphocyte cultures from all four monkeys receiving these vaccines 
showed MHC I restricted killing of autologous target cells that had been 
treated with vaccinia-gp!60. No CTL activity was observed in four 
control rhesus that had been immunized with blank' DNA vaccine 
(VUns without a gene insert). 

DETAILED DESCRIPTION OF THE INVENTION 

Nucleic acids, including DNA constructs and RNA 
transcripts, capable of inducing coordinate expression of two to three 
cistrons upon direct introduction into animal tissues, are presented. In 
one embodiment, coordinate expression of two cistrons encoding HTV 
proteins and elicitation of HTV specific immune responses against more 
than one gene products is demonstrated. Cytotoxic T lymphocytes 
(CTLs) specific for viral antigens which respond to different strains of 
human immunodeficiency virus (HIV), and antibodies which are 
generally strain-specific are generated. The generation of such CTLs in 
vivo usually requires endogenous expression of the antigen, as in the 
case of virus infection. To generate a viral antigen for presentation to 
the immune system, without the limitations of direct peptide delivery or 
the use of viral vectors, polynucleotides encoding HIV proteins are 
directly introduced into tissues of vertebrates in vivo , the 
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polynucleotides are taken up by cells within the tissue, and the encoded 
proteins produced and processed for presentation to the immune system. 
In mice, this resulted in the generation of HlV-specific CTLs and 
antibodies. Similar results are achieved in primates. These results are 
achieved with bi- or tri-cistronic nucleic acid polynucleotides encoding 
and co-expressing HIV gene products, immunostimuiatory gene 
products including but not limited to GM-CSF, interleukins, interferon 
and B7 proteins, which act as T-cell costimulatory elements. The 
methods and polynucleotides of this invention are generally applicable 
to co-ordinate expression in vivo of any two or three genes. Thus, 
various embodiments of this invention include coordinate expression in 
vivo of viral antigens and immunostimuiatory gene products as well as 
coordinate expression of tumor antigens and immunostimuiatory genes. 

This invention provides polynucleotides which, when 
directly introduced into a vertebrate in vivo , including mammals such as 
primates and humans, induces the expression of encoded proteins within 
the animal. 

As used herein, a polynucleotide is a nucleic acid which 
contains essential regulatory elements such that upon introduction into a 
living vertebrate cell, is able to direct the cellular machinery to produce 
translation products encoded by the genes comprising the 
polynucleotide. 

In one embodiment of the invention, the polynucleotide is a 
polydeoxyribonucleic acid comprising HIV genes operatively linked to a 
transcriptional promoter. In another embodiment of the invention, the 
polynucleotide vaccine comprises polyribonucleic acid encoding HIV 
genes which are amenable to translation by the eukaryotic cellular 
machinery (ribosomes, tRNAs, and other translation factors). Where 
the protein encoded by the polynucleotide is one which does not 
normally occur in that animal except in pathological conditions, (i.e. an 
heterologous protein) such as proteins associated with human 
immunodeficiency virus, (HIV), the etiologic agent of acquired immune 
deficiency syndrome, (AIDS), the animals' immune system is activated 
to launch a protective immune response. Because these exogenous 
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proteins are produced by the animals own tissues, the expressed 
proteins are processed by the major histocompatibility system, MHC, in 
a fashion analogous to when an actual infection with the related 
organism. HIV. occurs. The result, as shown in this disclosure, is 
induction of immune responses against the cognate pathogen. 

Accordingly, the instant inventors have prepared nucleic 
acids which, when introduced into the biological system induce the 
expression of HIV proteins and epitopes. The induced antibody 
response is both specific for the expressed HIV protein, and neutralizes 
HIV. In addition, cytotoxic T-lymphocytes which specifically recognize 
and destroy HIV infected cells are induced. The instant inventors have 
also developed polynucleotides whereby simian immunodeficiency virus 
(SIV) genes are efficiently expressed upon introduction in vivo . This 
achievement is significant because the only animal model closely 
mimicking the human disease, ADDS, is the subhuman primate model 
utilizing SIV. Thus, efficacy of the instant immunogens as vaccines can 
be shown by analogy to the effects obtained in vivo utilizing HIV and 
SIV polynucleotide immunogens. 

There are many embodiments of the instant invention 
which those skilled in the art can appreciate from the specifics taught 
herein. Thus, different transcriptional promoters, terminators, carrier 
vectors or specific gene sequences may be used successfully based on the 
successful invention disclosed herein. 

The instant invention provides a method for using a 
polynucleotide which, upon introduction into mammalian tissue, induces 
the co-expression in a single cell, in vivo , of two or more different, 
discrete gene products. The method is exemplified by using an HIV 
model which demonstrates the co-expression of more than one gene 
product in a single cell upon introduction of the polynucleotide into 
mammalian tissue in vivo . The model is stringent because certain HIV 
genes contain a sequence known as the REV responsive element (RRE). 
These genes are not efficiently expressed unless another HIV gene, 
known as REV, is also present within the cell expressing the RRE- 
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containing HIV gene. This phenomenon is described as REV 
dependence. 

PavJakis and Felber, WO 93/20212 have described a 
method of eliminating sequences which may induce transcript 
instability, which may also achieve some REV independence of certain 
HIV genes. That method may not be generally applicable to all such 
genes, is time-consuming and may require multiple gene modifications. 
Furthermore, the level of expression and immunogenicity of such genes 
may be compromised by elimination of the REV dependence. 

The instant invention provides a different solution which 
does not require multiple manipulations of REV dependent HIV genes to 
obtain REV-independence. In addition, the instant invention is 
applicable to expression of REV independent genes as well as to 
expression of REV dependent genes. The REV-dependent expression 
system described herein, is useful in its own right and is also useful as a 
stringent system for demonstrating the co-expression in a single cell in 
vivo of more than a single desired gene-product. Thus, in any 
circumstance in which it is beneficial to achieve the co-expression, 
within a given cell jn vivo, of more than a single gene product, the 
methods and polynucleotide constructs described herein may be 
employed. 

One situation, exemplified herein, is the co-expression of 
an immunogenic epitope and a member of the family of T-cell 
recognition elements known as B7. Recently, Steven Edgington 
[Biotechnology JJL:11 17-1119, 1993] reviewed the coordinate roles of 
B7 and the major histocompatibility complex (MHC) presentation of 
epitopes on the surface of antigen presenting cells in activating CD8+ 
CTLs for the elimination of rumors. Once a MHC molecule on the 
surface of an antigen presenting cell (APC) presents an epitope to a T- 
ceil receptor (TCR), B7 expressed on the surface of the same APC acts 
as a second signal by binding to CTLA-4 or CD28. The result is rapid 
division of CD4+ helper T-cells which signal CD8+ T-cells to 
proliferate and kill the APC. Thus, our demonstration herein of 
efficient expression and production of immune responses against an HIV 
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REV dependent gene containing an RRE by coordinate!)' expressing a 
gene lor REV, conclusively proves that more than one gene can be co- 
ordinately expressed by introducing a polynucleotide encoding two and 
even three cistrons (defined as a stretch of nucleic acid that carries the 
information for a polypepide chain). 

Because many of the applications of the instant invention 
apply to anti-viral vaccination, the polynucleotides are frequently 
referred to as a polynucleotide vaccine (PNV). This is not to say that 
additional utilities of these polynucleotides, in immune stimulation and 
in anti -tumor therapeutics, is to be ignored or considered to be outside 
the scope of the invention. 

In one embodiment of this invention, a gene encoding an 
HIV gene product is incorporated in an expression vector. The vector 
contains a transcriptional promoter recognized by an eukaryotic RNA 
polymerase, and a transcriptional terminator at the end of the HIV gene 
coding sequence. In a preferred embodiment, the promoter is the 
cytomegalovirus promoter with the intron A sequence (CMV-intA), 
although those skilled in the art will recognize that any of a number of 
other known promoters such as the strong immunoglobulin, or other 
eukaryotic gene promoters may be used. A preferred transcriptional 
terminator is the bovine growth hormone terminator. The combination 
of CMVintA-BGH terminator (Fig. 8, SEQ. ID: 13:) is particularly 
preferred. In addition, to assist in preparation of the polynucleotides in 
prokaryotic cells, an antibiotic resistance marker is also preferably 
included in the expression vector under transcriptional control of a 
prokaryotic promoter so that expression of the antibiotic does not occur 
in eukaryotic cells. Ampicillin resistance genes, neomycin resistance 
genes or any other pharmaceutically acceptable antibiotic resistance 
marker may be used. In a preferred embodiment of this invention, the 
antibiotic resistance gene encodes a gene product for neomycin 
resistance. Further, to aid in the high level production of the 
polynucleotide by fermentation in prokaryotic organisms, it is 
advantageous for the vector to contain a prokaryotic origin of 
replication and be of high copy number. Any of a number of 
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commercially available prokaryoric cloning vectors provide these 
benefits. In a preferred embodiment of this invention, these 
functionalities are provided by the commercially available vectors 
known as pUC. It is desirable, however, to remove non-essential DNA 
sequences. Thus, the lacZ and lad coding sequences of pUC are 
removed in one embodiment of the invention. It is also desirable that 
the vectors not be able to replicate in eukaryotie cells. This minimizes 
the risk of integration of polynucleotide vaccine sequences into the 
recipients' genome. 

In another embodiment, the expression vector pnRSV is 
used, wherein the Rous Sarcoma virus (RSV) long terminal repeat 
(LTR) is used as the promoter. In yet another embodiment, VI, a 
mutated pBR322 vector into which the CMV promoter and the BGH 
transcriptional terminator were cloned is used. In a particularly 
preferred embodiment of this invention, the elements of VI and pUCI9 
have been combined to produce an expression vector named VI J (SEQ. 
ID:12:). Into V1J or another desirable expression vector is cloned an 
HIV gene, such as gpI20, gp41, gpl60, gag, pol, env. or any other HIV 
gene which can induce anti-HIV immune responses (antibody and/or 
CTLs). Exclusion of functional reverse transcriptase and integrase 
functions encoded by the HTV genome is desirable to minimize the risk 
of integration of the polynucleotide vaccine encoded sequences into the 
recipients' genome. In another embodiment, the ampicillin resistance 
gene is removed from VI J and replaced with a neomycin resistance 
gene, to generate VU-neo (SEQ. ID: 14:), into which any of a number of 
different HIV genes have been cloned for use according to this 
invention. In yet another embodiment, the vector is VUns, which is the 
same as VUneo except that a unique Sfil restriction site has been 
engineered into the single Kpnl site at position 2114 of VI J-neo. The 
incidence of Sfil sites in human genomic DNA is very low 
(approximately 1 site per 100,000 bases). Thus, this vector allows 
careful monitoring for expression vector integration into host DNA, 
simply by Sfil digestion of extracted genomic DNA. In a further 
refinement, the vector is V1R. In this vector, as much non-essential 
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DNA as possible was "trimmed" from the vector to produce a highK 
compact vector. This vector is a derivative of V Uns and is shown in 
Figure 1 1, (SEQ.ID.: 100:). This vector allows larger inserts to be used, 
with less concern that undesirable sequences are encoded and optimizes 
uptake by cells when the construct encoding specific influenza virus 
genes is introduced into surrounding tissue. In figure 1 1, the portions 
of VUneo (Figure 7) that are deleted are shown as a gap, and inserted 
sequence is in bold text, but the numbering of VUneo is unchanged. 
The foregoing vector modification and development procedures may be 
accomplished according to methods known by those skilled in the an. 
The particular products described however, though obtained by 
conventional means, are especially useful for the particular purpose to 
which they are adapted. 

One embodiment of this invention incorporates genes 
encoding HIV gpl60, gp!20, gag and other gene products from such 
well known laboratory adapted strains of HIV as SF2, IDB or MN, for 
which a great deal of data has been generated, for example, such as 
showing that chimpanzees can be protected from a lethal challenge of 
HIV IIIB virus by first administering HIV Illb V3 loop specific 
monoclonal antibody [Emini et al.. Nature 355 : 728-730 1992], or by 
vaccination with recombinant gpl20 but not gpl60 [Berman et al., 
Nature 345 ■ 822-825, 1990]. Those skilled in the art will recognize 
that the use of genes from HIV-2 strains having analogous function to 
the genes from HIV-1 would be expected to generate immune responses 
analogous to those described herein for HIV-1 constructs. The cloning 
and manipulation methods for obtaining these genes are well known to 
those skilled in the art. 

There has recently been recognition that elicitation of 
immune responses against laboratory adapted strains of HIV may not be 
adequate to provide neutralization of primary, field isolates of HIV, [see 
for example Cohen, J., Science 262: 980-981, 1993]. Thus, in another 
embodiment of this invention, genes from virulent, primary field 
isolates of HIV are incorporated in the polynucleotide immunogen. 
This is accomplished by preparing cDNA copies of the viral genes and 
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then subclonmg the individual genes into the polynucleotide 
immunogen. Sequences for many genes of many HIV strains are now 
publicly available on GENBANK and such primary, filed isolates of 
HIV are available from the National Institute of Allergy and Infectious 
Diseases (NIAID) which has contracted with Quality Biological, Inc., 
[7581 Lindbergh Drive, Gaithersburg, Maryland 20879] to make these 
strains available. Such strains are also now available from the World 
Health Organization (WHO) [Network for HIV Isolation and 
Characterization, Vaccine Development Unit, Office of Research, 
Global Program on AIDS, CH-121 1 Geneva 27, Switzerland]. From 
this work those skilled in the art will recognize that one of the utilities 
of the instant invention is to provide a system for in vivo as well as in 
vitro testing and analysis so that a correlation of HIV sequence diversity 
with serology of HIV neutralization, as well as other parameters can be 
made. The isolation and cloning of these various genes may be 
accomplished according to methods known to those skilled in the art. 
Thus this invention further provides a method for systematic 
identification of HIV strains and sequences for vaccine production. 
Incorporation of genes from primary isolates of HIV strains provides an 
immunogen which induces immune responses against clinical isolates of 
the virus and thus meets a need as yet unmet in the field. Furthermore, 
as the virulent isolates change, the immunogen may be modified to 
reflect new sequences as necessary. 

To keep the terminology consistent, the following 
convention is followed herein for describing polynucleotide immunogen 
constructs: 

"Vector name-HIV strain-gene-additional elements". Thus, 
a construct wherein the gpl60 gene of the MN strain is cloned into the 
expression vector VI Jneo, the name it is given herein is: "VIJneo-MN- 
gp!60". The additional elements that are added to the construct are 
described in further detail below. Naturally, as the etiologic strain of 
the virus changes, the precise gene which is optimal for incorporation in 
the pharmaceutical may be changed. However, as is demonstrated 
below, because cytotoxic lymphocyte responses are induced which are 



WO 95/24485 



PCm'S95/02633 



capable of protecting against heterologous strains, the strain variability 
is less critical in the immunogen and vaccines of this invention, as 
compared with the whole virus or subui polypeptide based vaccines. 
In addition, because the pharmaceutical is easily manipulated to insert a 
new gene, this is an adjustment which is easily made by the standard 
techniques of molecular biology. 

To provide a complete description of the instant invention, 
the following background on HIV is provided. The human 
immunodeficiency virus has a ribonucleic acid (RNA) genome, the 
structure of which is represented in Figure I. This RNA genome must 
be reverse transcribed according to methods known in the art in order 
to produce a cDNA copy for cloning and manipulation according to the 
methods taught herein. At each end of the genome is a long terminal 
repeat which acts as a promoter. Between these termini, the genome 
encodes, in various reading frames, gag-pol-env as the major gene 
products: gag is the group specific antigen; pol is the reverse 
transcriptase, or polymerase; also encoded by this region, in an alternate 
reading frame, is the viral protease which is responsible for post- 
radiational processing, for example, of gpl60 into gpl20 and gp41; 
env is the envelope protein; vif is the virion infectivity factor; REV is 
the regulator of virion protein expression; neg is the negative 
regulatory factor; vpu is the virion productivity factor "u"; tat is the 
trans-activator of transcription; vpr is the viral protein r. The function 
of each of these elements has been described ( see AIDS 89, A Practical 
Synopsis of the V International Conference, June 4-9, 1989, Montreal, 
A Philadelphia Sciences Group Publication, from which figure 1 was 
adapted). 

In one embodiment of this invention, a gene encoding an 
HIV or SIV protein is directly linked to a transcriptional promoter. 
The env gene encodes a large, membrane bound protein, gpl60, which 
is post-translationally modified to gp41 and gpI20. The gpl20 gene 
may be placed under the control of the cytomegalovirus promoter for 
expression. However, gpl20 is not membrane bound and therefore, 
upon expression, it may be secreted from the cell. As HIV tends to 
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remain dormant in infected cells, it is desirable that immune responses 
directed at cell-bound HIV epitopes also be generated. This goal is 
accomplished herein by expression in vivo of the cell-membrane 
associated epitope, gp 1 60. to prime the immune system. However, 
expression of gpl60 is repressed in the absence of REV due to non- 
export from the nucleus of non-spliced genes. For an understanding of 
this system, the life cycle of HIV must be described in further detail. 

In the life cycle of HIV, upon infection of a host cell, HIV 
RNA genome is reverse-transcribed into a proviral DNA which 
integrates into host genomic DNA as a single transcriptional unit. The 
LTR provides the promoter which transcribes HIV genes from the 5' to 
3' direction (gag, pol, env), to form an unspliced transcript of the entire 
genome. The unspliced transcript functions as the rnRNA from which 
gag and pol are translated, while limited splicing must occur for 
translation of env encoded genes. For the regulatory gene product REV 
to be expressed, more than one splicing event must occur because in the 
genomic setting, REV and env, as is shown in figure 1 , overlap. In 
order for transcription of env to occur, REV transcription must stop, 
and vice versa. In addition, the presence of REV is required for export 
of unspliced RNA from the nucleus. For REV to function in this 
manner, however, a REV responsive element (RRE) must be present on 
the transcript [Malim et al. f Nature 338:254-257 (1989)]. 

In the polynucleotide vaccine of this invention, the 
obligatory splicing of certain HIV genes is eliminated by providing fully 
spliced genes (i.e.: the provision of a complete open reading frame for 
the desired gene product without the need for switches in the reading 
frame or elimination of noncoding regions; those of ordinary skill in 
the art would recognize that when splicing a particular gene, there is 
some latitude in the precise sequence that results; however so long as a 
functional coding. sequence is obtained, this is acceptable). Thus, in one 
embodiment, the entire coding sequence for gpl60 is spliced, and the 
sequence of REV is spliced, such that no intermittent expression of each 
gene product is required. Furthermore, the features of REV regulated 
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expression are exploited to optimize expression of HIV encoded REV- 
dependent. immunogenic gene products. 

For REV to function as an exporter of transcripts from the 
nucleus to be translated in the cytoplasm. REV requires, in addition to 
the presence of a REV responsive element (RRE) on the transcript to be 
exported, at least one splice donor site on the 5' side of the gene 
containing the RRE [Lu et al., P.N.A.S. USA 87:7598-7602, (October 
1990); Chang and Sharp, Cell 59:789-795 (December 1, 1989)]. The 
instant inventors conceived polynucleotides providing the REV coding 
sequence in a location on the same expression vector as the gene to be 
expressed such that co-expression of REV and the REV responsive gene 
occur without the need for any splicing. Thus, in a preferred 
embodiment of this invention, HIV genes are placed immediately 
downstream from a transcriptional promoter, such as the CMV 
promoter, and the spliced REV coding sequence is placed at a location 
3' to (also referred to as downstream from) the first coding sequence. 
Naturally, the order of these genes could be changed. However, it may 
be preferable to have the immunogenic HIV cistron abut directly to the 
2q transcriptional promoter to ensure that all transcripts produced encode 
the entire cistron. 

One method for achieving co-expression of genes relies on 
co-transfection of cells in culture with different vectors expressing 
different genes. For a REV dependent gene, the REV gene product 
2s could be provided in this manner in trans. However, this is suboptimal 
for the purposes of this invention, although not outside the scope of the 
instant invention, because of the low probability that co-transfection of a 
given ceil would occur in vivo so as to achieve the necessary availability 
of REV for vigorous expression of REV dependent immunogenic HIV 
gene products. Another method is to provide several promoters on a 
given vector, each promoter controlling expression of a separate gene. 
This amounts to providing REV gene product in cis. This solution may 
be employed according to the instant invention. In such an embodiment, 
it would be preferable for the various promoters and the genes they 
control to run in opposite directions. However, because of the Icnown 
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competitive interference between promoters in this type of multiple 
gene vector, this embodiment is also considered sub-optimal. 

Ghattas et al.. [Mol. and Cell. Biol. JJ_. No. 12:5848-5859 
(Dec. 199!)]. Kaufaman et al. [Nuc. Acids Res. 19. No. 16:4485-4490 
5 (1991)]. and Davies [J. Virol. 66, No. 4:1924-1932 (Apr. 1992)] have 
described an internal ribosome entry sue (IRES) in the 
encephalomyocarditis virus (EMCV) leader. They reported that a 
system in which an upstream promoter could be used to initiate 
transcription of a dicistronic mRNA provides good expression of both 
the 5 and ?' open reading frames when an IRES is located between the 
two genes. Chen et al. (J. ViraL, 67 : 2142-2145. 1993] have reported a 
system in which the 5 nontranslated region (NTR) from swine vesiculor 
disease virus (SVDV) was used to construct a bicistronic virus for the 
coexpression of two genes from one transcript from an infectious viral 
vector. 

The instant inventors have discovered that a nucleic acid 
construct which incorporates coordinated expression of an HIV gene 
containing a REV responsive element (RRE), an internal ribosome entry 
site (IRES) and a REV coding sequence results in efficient expression of 
both REV and the REV dependent gene product. This embodiment of 
the invention is better understood with reference to figures 2 and 3. 
Fig. 2 shows a generalized embodiment while. Figure 3, shows a 
specific embodiment of this invention which, according to the 
nomenclature system described above, is V 1 Jns-gpI60(RRE)-IRES- 
REV. The strain of HIV from which the immunogenic HIV gene is 
derived is irrelevant for the illustrative purposes of this discussion, and 
indeed, the expression of any REV dependent gene product is 
predictably efficient, as is the elicitation of immune responses against 
both REV and the REV dependent gene product, based on the instant 
patent disclosure. According to the embodiment shown in Fig. 3, the 
vector is VUns, described above. Thus, the promoter (CMVintA) and 
terminator (BGH) are provided for by the vector, along with a 
prokaryotic origin of replication, to facilitate large scale production of 
the HIV polynucleotide vaccine through fermentation of bacteria 
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transformed with the construct, according to methods well known in the 
art. This construct does not replicate in eukaryotic tissue, due to the 
absence of an eukaryotic origin of replication. A splice donor site from 
the naturally occurring rev/tat splice donor is provided (rev/tat SD) 
immediately preceding the HIV gene. The gag/pol/env coding sequence 
contains or is followed by a REV responsive element (RRE) which, 
upon formation of the nascent transcript, provides the necessary signals 
for REV binding to and export of the REV dependent mRNA from the 
nucleus. Next, there are sequences provided for reinitiation of 
translation at the internal ribosome entry site f IRES) so that the 
downstream REV coding sequence is efficiently translated. In this 
manner, REV gene product is provided in cis, on the same 
polynucleotide as a REV dependent gene product. 

In further refinements to the instant invention, a third 
cistron may be included in the PNV. The genes encoding such 
immunostimulatory proteins as the B7-antigen presenting cell-surface 
protein, the human granulocyte/monocyte colony stimulatory factor 
(GM-CSF) gene, and cytokine genes such as interleukin and interferon, 
the use of tissue-specific transcriptional promoters and enhancers, are 
all contemplated. The provision of B7 or GM-CSF gene in cis, either 
by insertion of an IRES after REV and before the B7 gene, by provision 
of a second promoter on the same vector construct as the dicistronic 
REV-dependent HIV gene, IRES-REV construct, or in trans using a 
separate construct are all envisioned by extension of the foregoing 
teachings regarding REV and REV dependent genes. The generalized 
immuno-stimulatory effect of these gene products may be sufficient 
even if provided in trans to enhance immune responses against the HIV 
gene products encoded by the immunogen of this invention. It is 
preferable, particularly for B7, that the same cell presenting HIV 
epitopes in the cleft of MHC-I molecules also present B7. This co- 
presentation of both the antigenic epitope and B7 "closes" the switch 
necessary for T-cell activation. Cytokines, particularly IL-12, which 
modifies whether a predominant humoral or cellular immune response 
is mounted [see Afonso et al.. Science 263 :235-237. 1994], either is 
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provided intravenously at the same time that PNV is introduced, or is 
included as a third cistron in the PNV, thereby assuring localized 
production of the interleukin. The genes for these immunostirnulatory 
and immunoregulatory proteins, including GM-CSF (see Shaw and 
Kamen, Cell 46:659-667, 1986 ). interleukin- 1 2 (see Wolf, S., et aL, L 
Immunol. 146 :3074-3081, 1991) and B7, (see Gordon et ah, L 
Immunol. 143 :2714-2722, 1989; for clones and sequences of newer 
members of the B7 family of proteins, see also Azuma, M. et al., 
Nature 366:76-79, 1 993; and Freeman, G., et al., Science 262:909-9 1 i . 
1993) are known and easily cloned and incorporated in PNV's 
according to this invention using methods known to the skilled 
practitioner. Preferably, the genes used for these purposes are the 
human genes so that immune responses against these proteins are 
minimized, allowing the expressed proteins to carry out their 
immunomodulatory and immunostirnulatory functions. Where HIV 
genes have been rendered REV-independent, the REV cistron may be 
eliminated completely and a second cistron encoding a B7 gene family 
member and a third cistron encoding yet another gene-product such as 
IL-12, may be constructed. 

The use of tissue-specific promoters or enhancers, for 
example the muscle creatine kinase (MCK) enhancer element, is 
desirable whenever it is desirable to limit expression of the 
polynucleotide to a particular tissue type. For example, myocytes are 
terminally differentiated cells which do not divide. Integration of 
foreign DNA into chromosomes appears to require both cell division 
and protein synthesis. Thus, limiting protein expression to non- 
dividing cells such as myocytes is preferable. However, use of the CMV 
promoter is adequate for achieving expression in many tissues into 
which the PNV is introduced. 

In the various embodiments of this invention which are 
described below, the basic paradigm described above is used. 
Deviations, additions or subtractions from this basic construction design 
serve to hi- light the various aspects of this invention. 
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This patent disclosure exemplifies bi- or tn-cistronic HIV 
polynucleotide immunogens as polynucleotide vaccines, PNVs, to 
generate humoral immunity as well as cross-strain cellular antiviral 
immunity. The system is useful, however, for any two or three 
cistrons, whether or not related to HIV, when co-expression of the 
encoded gene products in a single celi in vivo is required. However, the 
dual humoral and cellular immune responses generated according to this 
invention are particularly significant to inhibiting HIV infection, given 
the propensity of HIV to mutate within the infected population, as well 
as in infected individuals. In order to formulate an effective protective 
vaccine for HIV it is desirable to generate both a multivalent antibody- 
response for example to gpl60 (cnv is approximately 80% conserved 
across various HIV-1, clade B strains, which are the prevalent strains in 
US human populations), the principal neutralization target on HIV, as 
well as cytotoxic T cells reactive to the conserved portions of gp!60 
and, internal viral proteins encoded by gag. We have made an HIV 
vaccine comprising gpl60 genes selected from common laboratory 
strains; from predominant, primary viral isolates found within the 
infected population; from mutated gpl60s designed to unmask cross- 
strain, neutralizing antibody epitopes; from other representative HIV 
genes such as the gag gene (>95% conserved across HIV isolates); and 
from SIV, which provides an animal model for testing the HIV PNV 
wherein non-human primates can be immunized and challenged to test 
viral load and progression to disease. 

Virtually all HIV seropositive patients who have not 
advanced towards an immunodeficient state harbor ami-gag CTLs while 
about 60% of these patients show cross-strain, gpl60-specific CTLs. 
The amount of HIV specific CTLs found in infected individuals that 
have progressed on to the disease state known as AIDS, however, is 
much lower, demonstrating the significance of our findings that we can 
induce cross-strain CTL responses. Because HTV late gene expression is 
tfEVdependent our gpl60 and gag vaccination vectors are designed to 
also produce REV (-90% conserved), to facilitate the REV-dependent 
gene expression. An additional benefit of this invention is that anti- 
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REV immune responses are also generated. This gives further 
advantage to our vaccine because REV is made in large quantities very 
early following infection of a cell, and hours in advance of synthesis of 
the late gene products, thereby providing an earlier opportunity for 
intervention by vaccine-induced T-cell responses including CTLs and T- 
helper cells. 

In a further embodiment of this invention, a cocktail 
vaccine is prepared in which different HIV REV-dependent gene 
constructs are mixed together to generate anti-REV CTL responses in 
addition to antibodies and CTL against the immunogenic HIV REV- 
dependent gene products. According to this embodiment, one 
polynucleotide encoding gpl60, followed by REV, followed by B7, in a 
tri-cistronic construct having one promoter and two IRES sequences, is 
mixed with another polynucleotide encoding a gene product, REV, 
and B7 or another immunomodulatory or immunostimulatory gene 
product such as IL-12 or GM-CSF. In this fashion, with a single or 
several injections of polynucleotide, immune responses against several 
HIV related imrnunogens can be raised. Likewise, one polynucleotide 
comprising a REV independent gene product, such as those described in 
WO 93/20212, B7, and another immunomodulatory or 
immunostimulatory gene, such as IL-12 or GM-CSF, are mixed with 
another REV-dependent, or REV-independent bi- or tri-cistronic 
expression construct. Furthermore, multiple bi- or tri-cistronic 
constructs encoding HIV or other antigens could be prepared and mixed 
to produce a multivalent combination polynucleotide vaccine. 

Immune responses induced by our env , REV, and gag 
polynucleotide vaccine constructs are demonstrated in mice, rabbits, and 
primates. Monitoring antibody production to env in mice allows 
confirmation that a given construct is suitably immunogenic, i.e., a high 
proportion of vaccinated animals show an antibody response. Mice also 
provide the most facile animal model suitable for testing CTL induction 
by our constructs and are therefore used to evaluate whether a 
particular construct is able to generate such activity. However, mouse 
cell lines have been observed to not support efficient REV or tat 
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functions. This observation was made in the context of HIV LTR 
driven expression of late genes and a limited amount of data indicates 
that heterologous promoters allow REV function in mouse cells. 
Rabbits and monkeys (African Green, rhesus, chimpanzees) provide 
additional species including primates for antibody evaluation in larger, 
non-rodent animals. These species are also preferred to mice for 
antisera neutralization assays due to high levels of endogenous 
neutralizing activities against retroviruses observed in mouse sera. 
These data demonstrate that sufficient immunogenicity is engendered by 
our vaccines to achieve protection in experiments in a 
chimpanzee/HIViUB challenge model. The currently emerging and 
increasingly accepted definition of protection in the scientific 
community is moving away from so-called "sterilizing immunity", 
which indicates complete protection from HIV infection, to prevention 
of disease. A number of correlates of this goal include reduced blood 
viral titer, as measured either by HIV reverse transcriptase activity, by 
infecrivity of samples of serum, by ELISA assay of p24 or other HIV 
antigen concentration in blood, increased CD4+ T-cell concentration, 
and by extended survival rates [see, for example, Cohen, J., Science 
2£2:1820-1821, 1993, for a discussion of the evolving definition of anti- 
HIV vaccine efficacy]. The immunogens of the instant invention also 
generate neutralizing immune responses against infectious (clinical, 
primary field) isolates of HIV. 

Immunology 

A. Antibody Responses to env. 

1. gp!60 and gp!20 . An ELISA assay is used to determine 
whether vaccine vectors expressing either secreted gpl20 or membrane - 
bound gpl60 are efficacious for production of cm-specific antibodies. 
Initial in vitro characterization of env expression by our vaccination 
vectors is provided by immunoblot analysis of gpl60 transfected cell 
lysates. These data confirm and quantitate gpl60 expression using anti- 
gp41 and anti-gpl20 monoclonal antibodies to visualize transfectant cell 
gpl60 expression. In one embodiment of this invention, gpl60 is 
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preferred to gp 1 20 for the following reasons: (!) an initial epl20 
vector gave inconsistent immunogenicity in mice and was very poorly 
or non-responsive in African Green Monkeys; (2) gp!60 contributes 
additional neutralizing antibody as well as CTL epitopes by providing 
the addition of approximately 190 amino acid residues due to the 
inclusion of gp4I; (3) gpI60 expression is more similar to viral enr 
with respect to tetramer assembly and overall conformation; and (4) we 
find that, like the success of membrane-bound, influenza HA constructs 
for producing neutralizing antibody responses in mice, ferrets, and 
nonhuman primates [see Uimer et al.. Science 259:1745-1 749, 1993; 
Montgomery, D., et al., DNA and Cell Riot. 17-777-7^ 1993] anti- 
gpl60 antibody generation is superior to anti-gpI20 antibody 
generation. Selection of which type of env , or whether a cocktail of 
env subfragments, is preferred is determined by the experiments 
outlined below. 



2. Presence a nd Breadth of Neutralizing Activity . ELISA 
positive antisera from rabbits and monkeys is tested and shown to 
neutralize both homologous and heterologous HIV strains. 

3- V3 vs. non-V3 Neutra lizing Antibodies A major goal 
for env PNVs is to generate broadly neutralizing antibodies. It has now 
been shown that antibodies directed against V3 loops are very strain 
specific, and the serology of this response has been used to define 
strains. 

a. Non-V3 neutralizing antibodies appear to primarily 
recognize discontinuous, structural epitopes within gpl20 which are 
responsible for CD4 binding. Antibodies to this domain are polyclonal 
and more broadly cross-neutralizing probably due to restraints on 
mutations imposed by the need for the virus to bind its cellular ligand. 
An jn vitro assay is used to test for blocking gpl20 binding to CD4 
immobilized on 96 well plates by sera from immunized animals A 
second in vitro assay detects direct antibody binding to synthetic 
peptides representing selected V3 domains immobilized on plastic. 
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These assays are compatible for antisera from any of the animal types 
used in our studies and define the types of neutralizing antibodies our 
vaccines have generated as well as provide an in vitro correlate to virus 
neutralization. 

b. gp41 harbors at least one major neutralization 
determinant, corresponding to the highly conserved linear epitope 
recognized by the broadly neutralizing 2F5 monoclonal antibody 
(commercially available from Viral Testing Systems Corp., Texas 
Commerce Tower, 600 Travis Street, Suite 4750, Houston, TX 77002- 
30O5(USA), or Waldheim Pharmazeutika GmbH, Boltzmangasse 1 1, A- 
1091 Wien, Austria), as well as other potential sites including the well- 
conserved "fusion peptide' 1 domain located at the N-terminus of gp41. 
Besides the detection of antibodies directed against gp41 by immunoblot 
as described above, an in vitro assay test is used for antibodies which 
bind to synthetic peptides representing these domains immobilized on 
plastic. 

4. Maturation of the Antibody Response . In HIV 
seropositive patients, the neutralizing antibody responses progress from 
chiefly anti-V3 to include more broadly neutralizing antibodies 
comprising the structural gpl20 domain epitopes described above (#3). 
including gp41 epitopes. These types of antibody responses are 
monitored over the course of both time and subsequent vaccinations. 

B. T Cell Reactivities Against env . REV, nef and %a%. 

L Generatio n of CTL Responses . Viral proteins which are 
synthesized within cells give rise to MHC I-restricted CTL responses. 
Each of these proteins elicit CTL in seropositive patients. Our vaccines 
also are able to elicit CTL in mice. The immunogenetics of mouse 
strains are conducive to such studies, as demonstrated with influenza 
NP, [see Ulmer et al„ Science 259:1745-1749, 1993]. Several epitopes 
have been defined for the HIV proteins env, REV, nef and gag in 
Balb/c mice, thus facilitating in vitro CTL culture and cytotoxicity 
assays. Additionally, it is advantageous to use syngenic tumor lines, 
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such as the murine mastocytoma PHI 5, transfected with these genes to 
provide targets for CTL as well as for in vitro antigen specific 
restimulation. Methods for defining immunogens capable of elicitine 
MHC class I-restricted cytotoxic T lymphocytes are known [see Calin- 
Laurens, et a!., Vaccine il(9):974-978, 1993: see particularly Eriksson, 
et aL Vaccine, H(8):859-865, 1993, wherein T-cell activating epitopes 
on the HIV gpl 20 were mapped in primates and several regions, 
including gpl20 amino acids 142-192, 296-343, 367-400, and 410-453 
were each found to induce lymphoproliferation; furthermore, discrete 
regions 248-269 and 270-295 were lymphoproliferative. A peptide 
encompassing ammo acids 152-176 was also found to induce HIV 
neutralizing antibodies], and these methods may be used to identify 
immunogenic epitopes for inclusion in the PNV of this invention. 
Alternatively, the entire gene encoding gpl 60, gpl 20, protease, or gag 
could be used. For additional review on this subject, see for example, 
Shirai et aL, J. Immunol 148:1657-1667 1992; Choppin et al., L 
Immunol 147:569-574, 1991; Choppin et al.. J. Immunol 147 :575-583. 
1991; Berzofskv et al.. J. Clin. Invest R8-R7fi.8S4 1QQI As used 
herein, T-cell effector function is associated with mature T-cell 
phenotype, for example, cytotoxicity, cytokine secretion for B-cell 
activation, and/or recruitment or stimulation of macrophages and 
neutrophils. 

2 - Measurement of Th Activities . Spleen cell cultures 
derived from vaccinated animals are tested for recall to specific antigens 
by addition of either recombinant protein or peptide epitopes. 
Activation of T cells by such antigens, presented by accompanying 
splenic antigen presenting cells, APCs, is monitored by proliferation of 
these cultures or by cytokine production. The pattern of cytokine 
production also allows classification of Th response as type 1 or type 2. 
Because dominant TH2 responses appear to correlate with the exclusion 
of cellular immunity in immunocompromised seropositive patients, it is 
possible to define the type of response engendered by a given PNV in 
patients, permitting manipulation of the resulting immune responses. 
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3. Delayed Type Hypersen sitivity (DTH) . DTH to viral 
antigen after i.d. injection is indicative of cellular, primarily MHC II- 
restricted, immunity. Because of the commercial availability of 
recombinant HIV proteins and synthetic peptides for known epitopes, 
DTH responses are easily determined in vaccinated vertebrates using 
these reagents, thus providing an additional in vivo correlate for 
inducing cellular immunity. 

Protection 

Based upon the above immunologic studies, it is predictable 
that our vaccines are effective in vertebrates against c hallenge by 
virulent HIV. These studies are accomplished in an 
HIVniB/chimpanzee challenge model after sufficient vaccination of 
these animals with a PNV construct, or a cocktail of PNV constructs 
comprised of gpl60mB. *a#IIIB, nefniB and tf£V'HIB. The IHB 
strain is useful in this regard as the chimpanzee titer of lethal doses of 
this strain has been established. However, the same studies are 
envisioned using any strain of HIV and the epitopes specific to or 
heterologous to the given strain. A second vaccination/challenge model, 
in addition to chimpanzees, is the scid-hu PBL mouse. This model 
allows testing of the human lymphocyte immune system and our vaccine 
with subsequent HIV challenge in a mouse host. This system is 
advantageous as it is easily adapted to use with any HIV strain and it 
provides evidence of protection against multiple strains of primary field 
isolates of HIV, A third challenge model utilizes hybrid HIV/SIV 
viruses (SHIV), some of which have been shown to infect rhesus 
monkeys and lead to immunodeficiency disease resulting in death [see 
Li, J., et al., LAfl2£ £639-646, 1992]. Vaccination of rhesus with our 
polynucleotide vaccine constructs is protective against subsequent 
challenge with lethal doses of SHIV. 



PNV Construct Summary 
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HIV and other genes are preferably ligated into an 
expression vector which has been specifically optimized for 
polynucleotide vaccinations. According to this invention disclosure 
methods for producing several such vectors are enabled. Essential; .ill 
extraneous DNA is removed, leaving the essential elements of 
transcriptional promoter, immunogenic epitopes, and additional cis-ns 
encoding immunoenhancing or immunomodulatory genes, with the^ 
own promoters or IRES, transcriptional terminator, bacterial origu : 
replication and antibiotic resistance gene, as previously described <sr 
figure 2). Those skilled in the an will appreciate that introduction 
RNA which has been transcribed in vitro to produce the multi-cistrx: 
mRNAs encoded by the DNA counterparts of this invention natural 
forms an integral part of this invention. For this purpose, it is desircie 
to use as the transcriptional promoter such powerful RNA polymers 
promoters as the T7 or SP6 promoters, and performing run-on 
transcription with a linearized DNA template. These methods are w-.i 
known in the art. 

Expression of HIV late genes such as env and gag is IC - 
dependent and requires that the REV response element (RRE) be pr-.^nt 
on the viral gene transcript. A secreted form of gpI20 can be gene::ed 
in the absence of REV by substitution of the gpl20 leader peptide w:n 
a heterologous leader such as from tPA (tissue-type plasminogen 
activator), and preferably by a leader peptide such as is found in hir_v 
expressed mammalian proteins such as immunoglobulin leader peptic. 
We have inserted a tPA-gpl20 chimeric gene into VUns which 
efficiently expresses secreted gpl20 in transfected cells (RD, a hum_ 
rhabdomyosarcoma line). We have also developed an IRES-based 
(IRES = internal ribosomal entry site) dicistronic VUns vector 
containing both gpl60 (which harbors the RRE) and REV which 
efficiently expresses gp!60 in transfected cell lines (293, a human 
embryonic kidney cell line; and RD). Monocistronic gpl60 does n~ 
produce any protein upon transfection without the addition of a RE 
expression vector. Dicistronic gpl 60/REV produces similar amour 
of gpl60 as co-transfected gp!60 and REV monocistronic vectors 
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From these studies, it is predictable that dicistronic vectors more 
efficiently express gpl60 following introduction in vivo intramuscularly 
relative to a mixture of gpI60 and REV vectors because the dicistron 
insures the proximity of gpl60 construct and REV within structurally 
extended, multi-nucleated muscle cells. This dicistronic strategy also 
supports expression of gag after the inclusion of the RRE within the 
transcript region of the vector. It also supports the expression of 
unrelated genes in a bi- or tri-cistronic PNV, such as co-expression of 
HIV immunogenic epitopes, influenza virus immunogenic epitopes, 
cancer-related antigens, and immunomodulatory genes such as 
interleukin, B7 and GM-CSF. 

Representative Constr uct Components Include (but are not restricted to) 
(see figure 2. Cistrons I, II. and III): 

1. tPA-gpl20MN; 

2. gpl60niB/IRES//?£ViiTB; 

3. gpl60mB; 

4. K£VinB; 

5. tat/REV/gp\60 (a genomic IIIB clone which weakly 

expresses gpJ60); 

6. REV/gpi60; 

7. gpi60MN; 

8. gpl60 from clinically relevant primary HIV isolates; 

9. nef, using the gene from clinically relevant strains; 

10. gaguiB- foramina;? CTL; 

11. tPA-gpI20lHB: for chimp studies; 

12. gpl60 with structural mutations: V3 loop substitutions 
from clinically relevant strains of HIV; several mutations 
on several constructs such as variable loop removal, Asn 
mutations to remove stenc carbohydrate obstacles to 
structural, neutralizing antibody epitopes; and CD4 
binding site knockout mutants; 

13. gp41 : to specifically elicit anti-gp41 neutralizing 
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antibodies, particularly the 2F5 monoclonal antibody 
epitope, located directly anterior to the transmembrane 
domain, which is broadly conserved across many strains. 
This peptide is difficult to express in the absence of gp 1 20 
and requires several strategies, e.g.. a recent report found 
that the IF 5 epitope spliced into an influenza HA loop tip 
could elicit HIV neutralizing antibodies; alternatively, 
provision of appropriate leader sequences, as in the tPA 
signal peptide leader sequence, allows expression of this 
gene product; 

14. gag: similar to construct from #5 above, using the 
gene from clinically relevant strains; 

15. rev: for gpl60 and dicistromcs; 

16. B7 coding sequences; 

17. GM-CSF sequences; 

18. Interleukin sequences, particularly encoding IL-12; 

19. Tumor associated antigens; 

20. Genes encoding antigens expressed by pathogens other 
than HIV, such as, but not limited to, influenza virus 
nucleoprotein, hemagglutinin, matrix, neuraminidase, and 
other antigenic proteins; herpes simplex virus genes; human 
papillomavirus genes; tuberculosis antigens; hepatitis A, B. 
or C virus antigens; and combinations of these and other 
antigens to form at least dicistronic constructs which may 
be combined with multiple other polycistronic constructs to 
provide a cocktail composition capable of raising immune 
responses against all of the represented pathogens or 
tumor antigens. 

In the HIV env constructs, those of ordinary skill in the art 
will recognize the desirability of expressing nucleic acids encoding 
various env V3 loop amino acid sequences. As an example, any or all 
of the following amino acid sequences, or portions thereof, may be 
encoded by HIV polynucleotide immunogens of this invention: 
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GP160 V3 LOOP SFOT F NCK SI'MMARY FOR PNV 
( ONSTRl CTS 

Nonh American/European Consensus. SEQ.ID.l; 

CysThrArgProAsnAsnAsnThrArgLysSerlleHisIleGlyProGIyArgAla 
PheTyrThrThrGlyGluIIelleGlyAspIleArgGInAlaHisCys 

MN, SEQ.ID:2: 

CysThrArgProAsnTyrAsnLysArgLysArglleHisIleGIyProGlyArgAIa 
PheTyrThrThrLysAsnllelleGlyThrlleArgGlnAlaHisCys 

IIIB (HXB2R), SEQ.ID:3: 

CysThrArgProAsnAsnAsnThrArgLysArglleArglleGInArgGlyProGly 
ArgAlaPheValThrlleGlyLyslleGlyAsnMetArgGlnAlaHisCys 

1 16-v, SEQ.ID:4: 

CysThrArgProAsnAsnAsnThrArgLysGIylleHisIleGIyProGlyArgAla 
PheTyrThrThrGlyLysIlelleGlyAsnlleArgGinAlaHisCys 

452-p, SEQ.ID:5: 

CysThrArgProSerAsnAsnAsnThrArgLysSerlleHisIleGlyProGlyLys 
AlaPheTyrAlaThrGlyAlallelleGlyAspIleArgGInAlaHisCys 

146-v, SEQ.ID:6: 

CysThrArgProAsnAsnAsnThrArgArgSerlleHisIleAlaProGlyArgAla 
PheTyrAlaThrGlyAspIlelleGlyAspUeArgGInAlaHisCys 

The protective efficacy of polynucleotide HIV immunogens 
against subsequent viral challenge is demonstrated by immunization with 
the non-replicating plasmid DNA of this invention. This is 
advantageous since no infectious agent is involved, no assembly of virus 
particles is required, and determinant selection is permitted. 
Furthermore, because the sequence of gag and protease and several of 
the other viral gene products is conserved among various strains of 
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HIV. protection against subsequent challenge by a virulent strain of HIV 
that is homologous to, as well as strains heterologous to the strain from 
which the cloned gene is obtained, is enabled. 

The l.m. injection of a DNA expression vector encoding 
gp 1 60 results in the generation of significant protective immunity 
against subsequent viral challenge. In particular, gp 1 60-specific 
antibodies and primary CTLs are produced. Immune responses directed 
against conserved proteins can be effective despite the antigenic shift 
and drift of the variable envelope proteins. Because each of the HIV 
gene products exhibit some degree of conservation, and because CTLs 
are generated in response to intracellular expression and MHC 
processing, it is predictable that many virus genes give rise to responses 
analogous to that achieved for gp!60. Thus, many of these genes have 
been cloned, as shown by the cloned and sequenced junctions in the 
expression vector (see below) such that these constructs are 
immunogenic agents in available form. 

The invention offers a means to induce cross-strain 
protective immunity without the need for self-replicating agents or 
adjuvants. In addition, immunization with the instant polynucleotides 
offers a number of other advantages. First, this approach to vaccination 
should be applicable to tumors as well as infectious agents, since the 
CD8+ CTL response is important for both pathophysiological processes 
[K. Tanaka et ai, Annu. Rev. Immunol. 6, 359 (1988)]. Therefore, 
eliciting an immune response against a protein crucial to the 
transformation process may be an effective means of cancer protection 
or immunotherapy. Second, the generation of high titer antibodies 
against expressed proteins after injection of viral protein and human 
growth hormone DNA, [see for example D.-c. Tang et ai. Nature 356, 
152, 1992], indicates this is a facile and highly effective means of 
making antibody-based vaccines, either separately or in combination 
with cytotoxic T-lymphocyte vaccines targeted towards conserved 
antigens. 

The ease of producing and purifying DNA constructs 
compares favorably with traditional protein purification, facilitating the 
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generation of combination vaccines. Thus, multiple constructs, for 
example encoding gpl60, gpl20, gp4J, or any other HIV gene may be 
prepared, mixed and co-administered. Finally, becaus? protein 
expression is maintained following DNA injection [H. Lin etai. 
Circulation 82. 2217 (1990); R.N. Kitsis et al. Proc. Natl. Acad. Sci. 
(USA) 88, 4138 (1991); E. Hansen et al.. FEBS Lett 290, 73 (1991); 
S. Jiao et al.. Hum. Gene Therapy 3, 21 (1992); J. A. Wolff et al .. 
Human Mol. Genet. 1. 363 (1992)], the persistence of B- and T-celi 
memory may be enhanced [D. Gray and P. Matzinger. J. Exp. Med. 
174, 969 (1991); S. Oehen et al. ibid. 176, 1273 (1992)], thereby 
engendering long-lived humoral and cell-mediated immunity. 

The standard techniques of molecular biology for 
preparing and purifying DNA constructs enable the preparation of the 
DNA immunogens of this invention. While standard techniques of 
molecular biology are therefore sufficient for the production of the 
products of this invention, the specific constructs disclosed herein 
provide polynucleotide immunogens which surprisingly produce cross- 
strain and primary HIV isolate neutralization, a result heretofore 
unattainable with standard inactivated whole virus or subunit protein 
vaccines. 

The amount of expressible DNA or transcribed RNA to be 
introduced into a vaccine recipient will depend on the strength of the 
transcriptional and translational promoters used and on the 
immunogenicity of the expressed gene product. In general, an 
immunologically or prophyiactically effective dose of about 1 ng to 100 
mg, and preferably about 10 (ig to 300 jig is administered directly into 
muscle tissue. Subcutaneous injection, intradermal introduction, 
impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also 
contemplated. It is also contemplated that booster vaccinations are to be 
provided. Following vaccination with HTV polynucleotide immunogen, 
boosting with HIV protein immunogens such as gpl60, gpl20, and gag 
gene products is also contemplated. Parenteral administration, such as 
intravenous, intramuscular, subcutaneous or other means of 
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administration of interleukin- 1 2 protein, concurrent!} with or 
subsequent to parenteral introduction of the PNV of this invention a: / 
advantageous. 

The polynucleotide may be naked, that is, unassociated v;rh 
any proteins, adjuvants or other agents which impact on the recipient 
immune system. In this case, it is desirable for the polynucleotide t; ?e 
in a physiologically acceptable solution, such as, but not limited to. 
sterile saline or sterile buffered saline. Alternatively, the DNA ma> re 
associated with liposomes, such as lecithin liposomes or other liposomes 
known in the art, as a DNA-iiposome mixture, or the DNA may be 
associated with an adjuvant known in the art to boost immune respon-es. 
such as a protein or other carrier. Agents which assist in the cellule- 
uptake of DNA, such as, but not limited to, calcium ions, may also h-. 
used to advantage. These agents are generally referred to herein as 
transfection facilitating reagents and pharmaceutical^ acceptable 
carriers. Techniques for coating microprojectiles coated with 
polynucleotide are known in the art and are also useful in connectior 
with this invention. 

Accordingly, one embodiment of this invention is a 
polynucleotide which, upon introduction into mammalian tissue, indn.es 
the co-expression in a single cell, in vivo , of two or three different, 
discrete gene products, comprising; 

a first transcriptional promoter which operates efficiently in eukary.ric 
cells upstream from and in transcriptional control of a first cistron: 
a second cistron downstream from the first cistron, under 
transcriptional control either of the first transcriptional promoter, ■ 
under control of a second transcriptional promoter; 
optionally, a third cistron downstream from the second cistron, unaz' 
transcriptional control either of the first transcriptional promoter, 
under control of a second transcriptional promoter, or under conm jf 
a third transcriptional promoter; 

a transcriptional terminator following each of the first, second and ~^ird 
cistron. unless followed by another citron lacking its own 
transcriptional promoter. 
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In another embodiment, the invention is a polynucleotide 
which comprises contiguous nucleic acid sequences which cannot 
replicate in eukaryotic cells but which are capable of being expressed to 
produce a gene product upon introduction of the polynucleotide into 
eukaryotic tissues in vivo . The encoded gene product preferably either 
acts as an immunostimulant or as an antigen capable of generating an 
immune response. Thus, the nucleic acid sequences in this embodiment 
encode a spliced REV gene, a human immunodeficiency virus (HIV) 
immunogenic epitope, and optionally, a cytokine or a T-cell 
costimulatory element, such as a member of the B7 family of proteins. 

In another embodiment, the invention is a method for co- 
expression in a single cell, in vivo , of two or three different, discrete 
gene products, which comprises introducing between about 0.1 (ig and 
100 mg of a polynucleotide of this invention into the tissue of the 
vertebrate. 

In another embodiment, the invention is a method for using 
a REV dependent HIV gene to induce immune responses in vivo which 
comprises: 

a) isolating the REV dependent HIV gene; 

b) linking the isolated gene to regulatory sequences such 
that the gene is expressible by virtue of being operatively linked to 
control sequences which, when introduced into a living tissue, direct the 
transcription initiation and subsequent translation of the gene; 

c) introducing the expressible gene into a living tissue; 

d) introducing a gene encoding HIV REV either in trans or 
in cis to the HIV REV dependent gene; and 

e) optionally, boosting with additional expressible HIV 

gene. 

A further embodiment of this invention amounts to a 
method of inducing an antigen presenting cell to stimulate cytotoxic T- 
cell proliferation specific to HIV antigens. This involves exposing cells 
of a vertebrate in vivo to a polynucleotide which consists of an antigenic 
HIV epitope, REV if the antigenic HIV epitope depends on REV for 
efficient expression, and B7 encoding sequences. 
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The following examples are provided to further define the 
invention, without limiting the invention to the specifics of the 
examples. 

5 

Materials descriptions 

Vectors pF41 1 and pF412: These vectors were subcloned 
from vector pSP62 which was constructed in R. Gallo's lab. pSP62 is 
an available reagent from Biotech Research Laboratories, Inc. pSP62 
has a 12.5 kb Xbal fragment of the HXB2 genome subcloned from 
lambda HXB2. Sail and Xba I digestion of pSP62 yields to HXB2 
fragments: 5 , -XbaI/SalI, 6.5 kb and 3'- Sall/Xbal, 6 kb. These inserts 
were subcloned into pUC IS at Smal and Sail sites yielding pF41 1 (5'- 
Xbal/Sall) and pF41 2 (3'-XbaI/SaII). pF41 1 contains gag/pol and 
pF4 1 2 contains tat/rev/env/nef . 

Repligen reagents: 
recombinant rev (EIB), #RP1024-I0 
rec. gpl20 (niB), #RP1001-10 
2o anti-rev monoclonal antibody, #RPI029-10 
anti-gpl20 mAB, #1C1 , #RP1010-10 

AIDS Research and Reference Reagent Program: 
anti-gp41 mAB hybridoma, Chessie 8, #526 

25 EXAMPLE 1 

VECTORS FOR VACCINE PRODUCTION 

A) VI: The expression vector VI was constructed from pCMVIE-AKI- 
DHFR [Y. Whang ct at., J. Virol. 61, 1796 (1987)]. The AKI and 
3o DHFR genes were removed by cutting the vector with EcoR I and self- 
ligating. This vector does not contain intron A in the CMV promoter, 
so it was added as a PCR fragment that had a deleted internal Sac I site 
[at 1855 as numbered in B.S. Chapman et aL, Nuc. Acids Res. 19, 3979 
(1991)]. The template used for the PCR reactions was pCMVintA-Lux. 
made by ligating the Hind III and Nhe I fragment from pCMV6al20 
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[see B.S. Chapman et at., ibid.,] which includes hCMV-IEl 
enhancer/promoter and intron A, into the Hind III and Xba I sites of 
pBL3 to generate pCMVIntBL. The 1881 base pair luciferase gene 
fragment (Hind III-Sma I Klenow filled -in) from RSV-Lux [J.R. de Wet 
et al., Mol. Cell Biol. 7. 725, 1987] was cloned into the Sal I site of 
pCMVIntBL, which was Klenow filled-in and phosphatase treated. 

The primers that spanned intron A are: 
5" primer, SEQ. ID:7: 

.V-CTATATAAGCAGAG CTCGTTTAG-3 ; The 3' primer, SEQ ID:8: 
5 -GTAGCAAAGATCTAAGGACGGTGA CTGCAG-3'. 

The primers used to remove the Sac 1 site are: 
sense primer, SEQ ID:9: 

5-GTATGTGTCTGAAAATGAGCGTGGAGATTGGGCTCGCAC-3' 
and the antisense primer, SEQ ID: 10:, 

S'-GTGCGAGCCCAATCTCCACGCTCATTTTCAGACACA TAC-3'. 

The PCR fragment was cut with Sac I and Bgl II and 
inserted into the vector which had been cut with the same enzymes. 

B) VI J EXPRESSION VECTOR. SEQ. ID: 12: 

Our purpose in creating V1J was to remove the promoter 
and transcription termination elements from our vector, VI, in order to 
place them within a more defined context, create a more compact 
vector, and to improve plasmid purification yields. 

VI J is derived from vectors VI, (see Example 1) and 
pUC18, a commercially available plasmid. VI was digested with Sspl 
and EcoRI restriction enzymes producing two fragments of DNA. The 
smaller of these fFagments, containing the CMVintA promoter and 
Bovine Growth Hormone (BGH) transcription termination elements 
which control the expression of heterologous genes (SEQ ID: 13:). was 
purified from an agarose electrophoresis gel. The ends of this DNA 
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fragment were then "blunted" using the T4 DNA polymerase enzyme in 
order to facilitate us ligation to another "blunt-ended" DNA fragment. 

pUCIS was chosen to provide the "backbone" of the 
expression vector. It is known to produce high yields of plasmid. is 
well-characterized by sequence and function, and is of minimum size. 
We removed the entire lac operon from this vector, which was 
unnecessary for our purposes and may be detrimental to plasmid yields 
and heterologous gene expression, by partial digestion with the Haell 
restriction enzyme. The remaining plasmid was purified from an 
agarose electrophoresis gel, blunt-ended with the T4 DNA polymerase , 
treated with calf intestinal alkaline phosphatase, and ligated to the 
CMVintA/BGH element described above. Plasmids exhibiting either of 
two possible orientations of the promoter elements within the pUC 
backbone were obtained. One of these plasmids gave much higher 
yields of DNA in E. coli and was designated VI J (SEQ. ID:12:). This 
vector's structure was verified by sequence analysis of the junction 
regions and was subsequently demonstrated to give comparable or 
higher expression of heterologous genes compared with VI. 

Q VUneo EXPRESSION VECTOR. SFO ID- 14 - 

It was necessary to remove the amp r gene used for 
antibiotic selection of bacteria harboring VIJ because ampicillin may 
not be used in large-scale fermenters. The amp r gene from the pUC 
backbone of VIJ was removed by digestion with Sspl and Eaml 1051 
restriction enzymes. The remaining plasmid was purified by agarose 
gel electrophoresis, blunt-ended with T4 DNA polymerase, and then 
treated with calf intestinal alkaline phosphatase. The commercially 
available kan r gene, derived from transposon 903 and contained within 
the pUC4K plasmid, was excised using the PstI restriction enzyme, 
purified by agarose gel electrophoresis, and blunt-ended with T4 DNA 
polymerase. This fragment was ligated with the VI J backbone and 
plasmids with the kan r gene in either orientation were derived which 
were designated as VUneo #'s 1 and 3. Each of these plasmids was 
confirmed by restriction enzyme digestion analysis. DNA sequencing of 
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the junction regions, and was shown to produce simitar quantities of 
plasmid as VI J. Expression of heterologous gene products was also 
comparable to VIJ for these VlJneo vectors. We arbitrarily selected 
VUneo#3, referred to as VlJneo hereafter (SEQ. ID:14:), which 
contains the kan r gene in the same orientation as the amp r gene in VIJ 
as the expression construct. 

D) VUns EXPRESSION VECTOR: 

An Sfi I site was added to VlJneo to facilitate integration 
studies. A commercially available 13 base pair Sfi I linker (New 
England BioLabs) was added at the Kpn I site within the BGH sequence 
of the vector. VlJneo was linearized with Kpn I, gel purified, blunted 
by T4 DNA polymerase, and ligated to the blunt Sfi I linker. Clonal 
isolates were chosen by restriction mapping and verified by sequencing 
through the linker. The new vector was designated VUns. Expression 
of heterologous genes in VUns (with Sfi I) was comparable to 
expression of the same genes in VlJneo (with Kpn I). 

E) pGEM-3-IRES: The encephalomyocarditis virus (EMCV) internal 
ribosomal entry site (IRES) allows efficient expression of two genes 
within a single mRNA transcript when it is juxtaposed between them. 
We have utilized this non-coding gene segment to create dicistronic 
expression vectors for polynucleotide vaccines. The EMCV IRES 
segment was subcloned as a 0.6 kb EcoRl/BssHII digestion fragment 
from the pCITE-1 plasmid (Novagen). This fragment was agarose gel- 
purified, blunt-ended using T4 DNA polymerase and subsequently 
ligated into pGEM-3 (Promega) which had been Xbal-digested, blunt- 
ended with T4 DNA polymerase, and phosphatased. Clones were 
obtained for each of the two possible orientations of this DNA within 
pGEM-3 and each junction site verified by DNA sequencing. The 
preferred orientation for subsequent construction of dicistronic vectors 
positioned the Ncol site within the IRES proximal to BamHI site within 
pGEM-3. This vector is referred to as pGEM-3-IRES. 
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F) pGEM-3-IRES* : A second IRES vector was prepared containing 
mutations in the IRES sequence (IRES*) conferred by a PCR oligomer 
which may optimize IRES-driven expression compared to wild type 
IRES. PCR amplification of IRES* was performed using pCITE-1 
plasmid (Novagen) with the following sense and antisense oligomers: 
5 -GGT ACA AGA TCT ACT ATA GGG AGA CCG GAA TTC CGC- 
3', SEQ- ID: 1 1 :, and 5'-CCA CAT AGA TCT GTT CCA TGG TTG 
TGG CAA TAT TAT CAT CG-3\ SEQ. ID:15:, respectively. The 
mutated residue, underlined in the antisense codon. eliminates an 
upstream ATG from the preferred ATG contained within the 
Ncol/Kozak sequence at the 3-terminal end of the IRES 

G) pGEM-3-IRES/flEV : HIVn] D REV was PCR amplified from 
pCV-1 (catalogue #303, NIH AIDS Research and Reference Program) 
using synthetic oligomers. The sense and antisense oligomers were 5 - 
GGT ACA AGA TCT ACCATCGCA GGA AGA AGC GG A GAC 
AGC-3, SEQ. ID: 16:, and 5 -CCA CAT AGA TCT GAT ATC GCA 
CTA TTC TTT AGC TCC TGA CTC C-3\ SEQ. ID: 17:, respectively. 
These oligomers provide Bglll sites at either end of the translation open 
reading frame as well as an EcoRV site directly upstream from the 
Bglll site at the 3 -terminal end of rev. After PCR. the REV gene was 
treated with Ncol (located within the Kozak sequence) and Bglll 
restriction enzymes and Iigated with pGEM-3-IRES which had been 
treated with Ncol and BamHI restriction enzymes. Each ligation 
junction as well as the entire 0.3 kb REV gene was confirmed by DNA 
sequencing. 

H) VI Jns-tPA : In order to provide an heterologous leader peptide 
sequence to secreted and/or membrane proteins, VUn was modified to 
include the human tissue-specific plasminogen activator (tPA) leader. 
Two synthetic complementary oligomers were annealed and then Iigated 
into VUn which had been Bglll digested. The sense and antisense 
oligomers were 5 -GATC ACC ATG G AT GCA ATG AAG AGA GGG 
CTC TGC TGT GTG CTG CTG CTG TGT GGA GCA GTC TTC GTT 
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TCG CCC AGC GA-.V. SEQ.ID: IS:, and 5 -GAT CTC GCT GGG ; J A 
AAC GAA GAC TGC TCC ACA CAG CAG CAG CAC ACA GC, 
GAG CCC TCT CTT CAT TGC ATC CAT GGT-3', SEQ. ID: 19:. Tie 
Kozak sequence is underlined in the sense oligomer. These oligomer 
have overhanging bases compatible for ligation to Bglll-cleaved 
sequences. After ligation the upstream BgiU site is destroyed while ne 
downstream Bglll is retained for subsequent ligations. Both the 
junction sites as well as the entire tPA leader sequence were verifiec ?y 
DNA sequencing. Additionally, in order to conform with our cons-isus 
optimized vector VUns (=VUneo with an Sfil site), an Sfil restnctun 
site was placed at the Kpnl site within the BGH terminator region o 
VUn-tPA by blunting the Kpnl site with T4 DNA polymerase folloved 
by ligation with an Sfil linker (catalogue #1 138, New England Biok?s). 
This modification was verified by restriction digestion and agarose ^1 
electrophoresis. 

I) VUns-HTVinb REV • REV was amplified by PCR as describee 
above for pGEM-3-IRES/KEV, digested with Bglll restriction enzyne. 
and ligated into VUns which had been Bglll- and calf intestinal alkaine 
phosphatase-treated. Ligation junctions were confirmed by DNA 
sequencing and expression of REV was verified by in vitro transfeciun 
of RD cells and immunoblot analysis (greater than 1 ug REV obtain^ 
per 10 6 cells). 

J) pGEM-3-RRE/IRES/REV: In order to make a cassette consisting 
the REV response element (RRE) which is required to be on an R]\ - 
transcript in order for REV -dependent expression to occur, the RRI 
from HIV strain HXB2 was obtained by PCR using the following 
synthetic oligomers: sense oligomer, 5'-GGT ACA TGA TCA GA" 
ATC GCCC GGG C CGA GAT CTT CAG ACT TGG AGG AGG 
3', SEQ.ID:20:; and antisense oligomer, 5-CCA CAT TGA TCA C 
CTT GTG TAA TTG TTA ATT TCT CTG TCC-3\ SEQ.ID:21:. 
These oligomers provide Bell restriction sites at either end of the in ert 
as well as EcoRV and Srfl sites at the 5'-end of the insert. The RRZ 
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was blunt-end ligated into pGEM-3-/IRES/REV at the Hindi restriction 
.site which precedes IRES. The ligation products were verified by 
restriction enzyme mapping and by DNA sequencing across the ligation 
junctions. 

EXAMPLE 2 

gpl 20 Vaccines: 

Expression of the REV -dependent env gene as gpl 20 was 
conducted as follows: gpl 20 was PCR-cloned from the MN strain of 
HIV with either the native leader peptide sequence (VI Jns-gpl20), or as 
a fusion with the tissue-plasminogen activator (tPA) leader peptide 
replacing the native leader peptide (VUns-tPA-gpI20). tPA-gpl20 
expression has been shown to be /?£V'-independent [B.S. Chapman et al., 
Nuc. Acids Res. 19, 3979 (1991); it should be noted that other leader 
sequences would provide a similar function in rendering the gpl 20 gene 
REV independent]. This was accomplished by preparing the following 
gpl 20 constructs utilizing the above described vectors: 



1. cp!20 VACCINE CONSTRUCTS: 

A) VlJns-tPA-HTVM N gp!20 : HIVMN gp 120 gene (Medimmune) 
was PCR amplified using oligomers designed to remove the first 30 
amino acids of the peptide leader sequence and to facilitate cloning into 
VUns-tPA creating a chimeric protein consisting of the tPA leader 
peptide followed by the remaining gpl 20 sequence following amino acid 
residue 30. This design allows for REV -independent gpl 20 expression 
and secretion of soluble gpl 20 from cells harboring this plasmid. The 
sense and antisense PCR oligomers used were 5'-CCC CGG ATC CTG 
ATC ACA GAA AAA TTG TGGGTC ACA GTC-3\ SEQ. ID:22:, and 
5-C CCC AGG A ATC CAC CTG TTA GCG CTT TTC TCT CTG 
CAC CAC TCT TCT C-3 SEQ- ID:23:. The translation stop codon is 
underlined. These oligomers contain BamHI restriction enzyme sites at 
either end of the translation open reading frame with a Bell site located 
3' to the BamHI of the sense oligomer. The PCR product was 
sequentially digested with Bell followed by BamHI and ligated into 
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VUns-tPA which had been BgUI digested followed by calf intestinal 
alkaline phosphatase treatment. The resulting vector was sequenced to 
confirm inframe fusion between the tPA leader and gp!20 coding 
sequence, and gp 1 20 expression and secretion was verified by 
immunoblot analysis of transfected RB ceils. Thus, this vector encoding 
the tPA-HIVMN-gpl20 is useful for inclusion in a bi- or tri cistronic 
construct expressing gag, B7 or other antigens. 

B) Vl-tPA-HIVM N gp!20 : A slightly different version of the chimeric 
tPA-HIViynsj gp!20 vector described above was made using an earlier 
version of our basic vaccine expression vector. VI (see Nucleic Acid 
Pharmaceuticals patent), which contained a somewhat different tPA 
peptide leader sequence from that described for VUns-tPA. 

In either of the foregoing PNV constructs, provision of an IRES 
sequence after the translation stop codon, and downstream cloning of 
immunomodulatory genes such as B7, provides bi- or tri-cistronic 
polynucleotides useful according to the method of this invention. These 
PNV's efficiently express both gene products. 

C) VI Jns-tPA-HIVr rm gp!20 : This vector is analogous to I. A. except 
that the HIV IIIB strain was used for gpl20 sequence. The sense and 
antisense PCR oligomers used were: 5-GGT ACA TGA TCA CA GAA 
AAA TTG TGG GTC ACA GTC-3\ SEQ.ID:24:, and 5-CCA CAT 
TGA TCA GAT ATC TTA TCT TTT TTC TCT CTG CAC CAC TCT 
TC-3\ SEQ.ID:25: t respectively. These oligomers provide Bell sites at 
either end of the insert as well as an EcoRV just upstream of the Bell 
site at the 3 -end. The ^'-terminal Bell site allows ligation into the 
BglD site of VUns-tPA to create a chimeric tPA-gpl20 gene encoding 
the tPA leader sequence and gpl20 without its native leader sequence. 
Ligation products were verified by restriction digestion and DNA 
sequencing. 
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II. IN VITRO gpl20 VACCINE EXPRESSION- 

Jn vitro expression was tested in transfected human rhabdomyosarcoma 
(RD) cells for these constructs. Quantitation of secreted tPA-epl20 
from transfected RD cells showed that V 1 Jns-tPA-gpl 20 vector 
produced secreted gp!20. 

III. IN VIVO gp!20 VACCINATION: 
See figure 12 (mouse data): 



Anti-gpI20 ELISA Titers Flic iter! 
by Secreted gp 120 * 



15 



Species 



GMT (range) 



mouse 
(post 2 rounds, 200u,g 

per round) 



5,310 (1.8 x 103. 1.5 x I0 4 ) 



20 



rabbit 

(post 3 rounds, 2 mg 
per round) 



143 (75- 212) 



25 



A.G. monkey 
(post 2 rounds, 
2 mg per round) 



171 (< 10-420) 



30 



"Using VUns-tPArgpl20niB as the inoculation vector, intramuscularly. 
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VUns-tPA-gpl20 MNj PNV-induced CIhss II MHC- 
restncted T lymphocy te gp!20 specific antigen reactivities Balb/c mice 
which had been vaccinated two times with 200 |!g VIJns-tPA-gpl 20MN 
were sacrificed and their spleens extracted for in vitro determinations 
of helper T lymphocyte reactivities to recombinant gpi20. T cell 
proliferation assays were performed with PBMC (peripheral blood 
mononuclear cells) using recombinant gpl20lHB (Repligen, catalogue 
#RP1016-20) at 5 jig/ml with 4 x 105 cells/ml. Basal levels of ^H- 
thymidine uptake by these cells were obtained by culturing the cells in 
media alone, while maximum proliferation was induced using ConA 
stimulation at 2 jig/ml. ConA-induced reactivities peak at ~3 days and 
were harvested at that time point with media control samples while 
antigen-treated samples were harvested at 5 days with an additional 
media control. Vaccinated mice responses were compared with naive, 
age-matched syngenic mice. ConA positive controls gave very high 
proliferation for both naive and immunized mice as expected. Very 
strong helper T cell memory responses were obtained by gpl20 
treatment in vaccinated mice while the naive mice did not respond (the 
threshold for specific reactivity is an stimulation index (SI) of >3-4; SI 
is calculated as the ratio of sample cpm/media cpm). Sis of 65 and 14 
were obtained for the vaccinated mice which compares with anti-gpl20 
ELISA titers of 5643 and 1 1 ,900, respectively, for these mice. 
Interestingly, for these two mice the higher responder for antibody gave 
significantly lower T cell reactivity than the mouse having the lower 
antibody titer. This experiment demonstrates that the secreted gpl20 
vector efficiently activates helper T cells in vivo as well as generates 
strong antibody responses, in addition, each of these immune responses 
was determined using antigen which was heterologous compared to that 
encoded by the inoculation PNV (UIB vs. MN): 
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Splenic T Cell Proliferation Responses to rgp 1 20 Following 
Vaccination with V1Jns-tPA-gp1 20m N 

Avg. CPM (Stimulation Index) 

5 

Mouse #(agp120 titer} 3 Media 1 ConA 1 Media 2 

rgpl 202 

#1 (naive; <10) 339 (1) 185,358 (546) 187 (1) 574 (3) 

10 

#2 (naive; <10) 237 (1) 229,775 (969) 283 (1) 51 1 
(1-8) 

#3 (immune; 5643) 317 (1) 221,003 (697) 354 (1) 
23,109 (65) 

15 

#4 (immune; 1 1,900) 229 243,427 (1063) 235 (1) 

3384 (14) 

1 Cells harvested on day 4 following 24 hr with 3H-thymidine. 
ConA was used at 2 u,g/ml concentration. 

20 

2 Cells harvested on day 5 following 24 hr with 3 H-thymidine . 
Recombinant gp120|||B was used at 5 (ig/ml concentration. 



3 Anti-gp1 20j||B reciprocal endpoint ELISA titers and 
proliferation assays performed following 2 rounds of 200 jig 
DNA/mouse (Ba!b/c). 

The foregoing data clearly demonstrates efficient in vivo 
expression of relevant HIV antigens with a polynucleotide vaccine 
antigen and elicitation of specific immune responses to the expressed 
gene product. This construct is easily modified to form a bi-cistronic 
PNV of this invention by including, downstream from the gp!20 
translation stop codon, an second or third cistron encoding REV, B7, 
gag or other antigens unrelated to HIV. such as influenza nucleoprotein 
or hemagglutinin encoding genes. 
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EXAMPLE 3 

gp!60 VACCINES 

In addition to secreted gp 1 20 constructs, we have prepared 
expression constructs for full-length, membrane-bound gp 1 60. The 
rationales for a gp 1 60 construct, in addition to gp 1 20, are (1) more 
epitopes are available both for both CTL stimulation as well as 
neutralizing antibody production including gp4 1 , against which a potent 
HIV neutralizing monoclonal antibody (2F5, see above) is directed; (2) 
a more native protein structure may be obtained relative to virus- 
produced gpl60; and, (3) the success of membrane-bound influenza HA 
constructs for immunogenicity [Ulmeret aL Science 259 : 1745-1749, 
1993; Montgomery. P.. et at.. DNA and Cell Biol.. 12:777-783. 1993). 

gpl60 retains substantial REV dependence even with a 
heterologous leader peptide sequence. Therefore, two strategies 
independent from that employed for gpI20 expression were developed 
for preparing a gpl60 expression vector: (1 ) subcloning into VI Jns a 
genomic HIV DNA fragment reported to be effective for heterologous 
gpl60 expression containing tat. REV and gpl60 in entirety (VUns- 
tat/REV/env ), [Wang et al., P.N.A.S. USA 90:4156-4160 (May, 1993); 
all of the data reported in that study were generated using bupivacaine 
injection about 24 hours prior to nucleic acid injection. As bupivicaine 
is known to cause muscle damage, this is a regiment that clearly could 
not be used to immunize humans], and (2) PCR-cloning a minimal 
gpl60 ORF into a dicistronic vector before the EMCV internal 
ribosomal entry site (IRES) to efficiently reinitiate translation following 
gpl60 translation for a second cistron encoding REV. This construct 
ensures effective simultaneous production of both gpl60 and REV 
proteins (VI Jns-gpl60/TRES/rcv). Each of these vectors has been 
prepared in addition to the monocistronic vectors VUns-gpl60 and 
VUns-KEV. Because there is evidence, in the literature and from our 
own experiments (see below), that the env mRNA requires the tat/REV 
splice donor (SD) site for stability in heterologous expression systems, 
VUns-gpl60 and VI Jns-gpl60/IRES//?£l/ were also prepared with this 
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SD inserted upstream of the enr ORF. These vaccine constructs were 
prepared as follows. 

T p pi60 VACCINE CONSTRUCTS: 

Both gpl60 expression vectors, VIJns-gpl60 and VUns- 
cpl60/IRES/rev (see A and B below > were prepared with the tat/rev 
splice donor (SD) inserted immediately upstream of gp!60 sequences at 
the Pstl site within VUns (this is the solitary' PstI site within both of 
these vectors). Synthetic complementary oligomers encoding the SD 
were designed to ligate into the Pstl site retaining the original site at the 
5' -end but destroying the Pstl site at the 3'-end of the insert after 
li cation. The oligomer sequences used were: 5'-GTC ACC GTC CTC 
TAT CAA AGC AGT AAG TAG TAC ATG CA-3'. SEQ.ID:26: and 
5 -TGT ACT ACT TAC TGC TTT GAT AGA GGA CGG TGA CTG 
CA-3\ SEQ.ID:27;. The resulting plasmids were verified by restriction 
digestion mapping and by DNA sequencing across the entire SD/PstI 
region. 

A) VUns-HIVnThgpl60 : HlVmb gp!60 was cloned by PCR 
amplification from plasmid pF412 which contains the 3'-terminal half 
of the HlVmb genome derived from HlVmb clone HXB2. The PCR 
sense and antisense oligomers were 5-GGT ACA TGA TCA ACC ATG 
AGA GTG AAG GAG AAA TAT CAG C-3\ SEQ. ID:28:, and 5'-CCA 
CAT TGA TCA GAT ATC CCC ATC TTA TAG CAA AAT CCT TTC 
C-3', SEQ. ID:29:, respectively. The Kozak sequence and translation 
stop codon are underlined. These oligomers provide Bell restriction 
enzyme sites outside of the translation open reading frame at both ends 
of the enr gene. (Bell-digested sites are compatible for ligation with 
Bglll-digested sites with subsequent loss of sensitivity to both restriction 
enzymes. Bell was chosen for PCR-cloning gpl60 because this gene 
contains internal BglD and as well as BamHI sites). The antisense 
oligomer also inserts an EcoRV site just prior to the Bell site as 
described above for other PCR-derived genes. The amplified gpl60' 
gene was agarose gel-purified, digested with Bell, and ligated to VUns 
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which had been digested with Bglll and treated with calf intestinal 
alkaline phosphatase. The cloned gene was about 2.6 kb in size and each 
junction of gp 1 60 with VUns was confirmed by DNA sequencing. 

B) . VUns-HIVm h zp\60f\RES/REV : pGEM-3-IRES/KEV was 
digested with HinDII and Smal restriction enzymes (contained within 
the pGEM-3 multi-linker region) to remove the entire IRES/REV 
sequence (-0.9 kb) and then ligated with VUns-HIVrjIbgpl 60 which 
had been digested with EcoRV and phosphatased. This procedure 
yielded an 8.3 kb dicistronic VUns containing gpl60 followed by IRES 
and REV which directs expression of both of these HIV gene products. 
AH of the junction regions were verified by DNA sequencing. 

C) VlJns-tPA-HIVT TTR gp!60 : This vector is similar to Example 2(C) 
above, except that the full-length gpl60, without the native leader 
sequence, was obtained by PCR. The sense oligomer was the same as 
used in I.C. and the antisense oligomer was 5'-CCA CAT TGA TCA 
GAT ATC CCC ATC TTA TAG CAA AAT CCT TTC C-3\ 
SEQ.ID:30;. These oligomers provide Bell sites at either end of the 
insert as well as an EcoRV just upstream of the Bell site at the 3'-end. 
The 5-terminal Bell site allows ligation into the Bglll site of ViJns-tPA 
to create a chimeric tPA-gp!60 gene encoding the tPA leader sequence 
and gpl60 without its native leader sequence. Ligation products were 
verified by restriction digestion and DNA sequencing. 

D) V 1 ]ns-tat/rev/envY[lB ' This expression vector is patterned after one 
described by D. Rekosh et ah [Proc. Natl. Acad. Sci. USA, 85, 334 
(1988)] employing a "genomic" segment of an HIV-1 ETIB clone (HXB2) 
encompassing unspliced tat, rev, and env in their entirety. VUns was 
digested with Bglll followed by T4 DNA polymerase blunting and calf 
intestinal alkaline phosphatase treatment. A Sall/Xhol fragment of the 
IIIB genome contained within pF412 was obtained by restriction 
digestion and blunted with T4 DNA polymerase. Ligation products 
were verified by restriction digestion mapping and DNA sequencing. 
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E) VI Jns-;rv/f/7vIIIB: This vector is a variation of the one described in 
section D above except that the entire tat coding region in exon ] is 

^ deleted up to the beginning of the REV open reading frame. VUns- 
gpI60lIIB (see section A. above) was digested with Psti and Kpnl 
restriction enzymes to remove the 5 -region of the gpl60 gene. PCR 
amplification was used to obtain a DNA segment encoding the first/?£V 
exon up to the Kpnl site in gp!60 from the HXB2 genomic clone. The 
sense and antisense PCR oligomers were 5-GGT ACA CTG CAG TCA 
CCG TCC T ATG GCA GGA AGA AGC GGA GAC-3\ SEQ.ID:3 1 : 
and 5 -CCA CAT CA GGT ACC CCA TAA TAG ACT GTG ACC-3, 
SEQ-ID:32: respectively. These oligomers provide Psti and Kpnl 
restriction enzyme sites at the 5 - and 3'- termini of the DNA fragment, 

i5 respectively. The resulting DNA was digested with Psti and Kpnl, 
purified from an agarose electrophoretic gel, and ligated with VUns- 
gpl60(PstI/KpnI). The resulting plasmid was verified by restriction 
enzyme digestion. 

20 II. IN VITRO EXPRESSION OF gnl60 VACrTMF.- 

RD and 293 cells were transiently transfected with gpl60 
and REV expression constructs. A Western blot analysis shown in Fig. 
5 using an anti-gp41 monoclonal antibody (Chessie 8, NIH AIDS 
Research and Reference Program #526) showed that gpl60 expression 

25 by VUns-gpl60 (SD) required the addition of VUns-REV (this vector 
produces > 1 fig REV/10 6 cells in transient transfections). VUns- 
gpl60/IRES/REV efficiently expressed gpl60 without additional REV 
added in trans, confirming function of the dicistron. Similar results 
were found with an anti-gpl20 monoclonal antibody (1C1, Repligen, 

3o #RP10I0-10) for immunoblot visualization. Proteolytic processing of 
gp!60 to the mature gp!20 and gp41 forms was observed for each 
vector. Addition of REV in trans to the dicistronic gpl60/REV vector 
did not result in more gpI60 expression indicating that REV expression 
is not limiting for gpl60 expression in this vector. Expression of gpl60 
improved if the tat/REV SD was included within dicistronic gp!60/REV 
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construct indicating the importance of this site for optimal REV- 
dependent gp!60 expression. We were also surprised to discover that 
dicistronic gpl60/REV expressed more than ten-fold more gp!60 than 
the genomic tat/REV/env construct for transient transfections, again 
demonstrating the high efficiency of this vector for gp 1 60 expression. 

These vectors provide nucleic acid constructs for gpl60 
plasmid vaccinations with gpl60 and REV genes either on separate 
plasmids or on the same piasmid. In the case of the tpa-gpl60 
construct, REV need not be provided in cis or in trans to achieve 
efficient gp!60 expression, therefore allowing other genes to be 
incorporated in a dicistronic construct. 

For the REV-dependent constructs, it is important to test 
whether effective gpI60 expression following vaccination requires REV 
to be present on the same plasmid because very small quantities of DNA 
are taken up by muscle cells following intramuscular injection, and 
individual muscle cells (each having hundreds of nuclei) may not 
receive copies of different plasmids in proximal locations within the 
cell. 

III. IN VIVO VACCINATION WITH gp!60 VACCINES: 

Three different vector strategies were compared for their 
abilities to induce anti-gpl20 antibody responses in nonhuman primates 
using PNVs encoding gpl60: vaccination with (1) dicistronic 
gpl60/REV using VUns-gpl60niB/IRES/REV (SD); (2) the genomic 
gpl60 construct VlJns/tat/rev/envniB; and (3) a mixture of 
monocistronic vectors, VI Jns-gpl60lIIB (SD) and VUns-REV. 
Vaccination doses of 2 mg/animal were used for up to three vaccination 
rounds which were delivered at one month intervals while 
simultaneously obtaining bleeds. Anti-gpl20 ELISA titers using 
recombinant gpl20inB are shown for monkeys vaccinated with each of 
these vectors. Dicistronic gpl60/REV elicited antibody responses in 
both rhesus and African Green monkeys while the genomic gp!60 and 
mixed monocistronic vectors did not elicit detectable antibodies after 
two rounds of vaccination (i.e., one month following the second 
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vaccination). All four monkeys which received dicistronic gpI60/REV 
also showed specific anti-gp41 reactivities as measured bv the BIAcore 
assay using recombinant gp41 (ABT) as the immobilized substrate (data 
not shown). The sera obtained from these monkeys also showed anti- 
V3IHB ELISA reactivities with titers ranging from -50 - 100. These 
results prove that in vivo expression induced by PNV for multiple 
cistrons is not analogous to results obtained by in vitro transfection 
methods in which gp 1 60 expression was shown for all three vector 
strategies. Note especially that in vitro transfection resulted in 
equivalent expression by the mixed monocistronic gpl60 and REV 
vectors as compared to dicistronic gp!60/REV {see Fig. 5). These 
experiments prove that our dicistronic PNVs do deliver effective 
coordinate expression following in vivo vaccination while other 
methods of vaccination with multiple cistrons were unable to do so. 
See figure 9, showing two African Green Monkeys and two rhesus 
monkeys and one rabbit's immune responses. 
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Anti-QD^20 ELISA Titers Elicited hv qd16Q PIWr 
in Non-Human Primates 
2 mg DNA per round 

Titer 



VectorvSoecies post 2 nd post 3 id 

VUns-tat/rev/env nm: 

African Green (#1) <20 ND 1 

(#2) <20 \sD 

(#3) <20 |ND 

V1 Jns-aDl6Q mR + VUns-rev ?: 

African Green (#1) <20 tsD 

(#2) <20 hO 

(#3) <20 ND 

V1 Jns-QDlBnnip/IRES/rev : 

African Green (#1 ) 85 90 

(#2) 75 60 

Rhesus (#1) 165 175 

(#2) 290 260 



1 ND = not determined. 

2 This PNV represents an equimolar mixture of the two monocistronic 
vectors. 
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Anti-V3 ^ ELISA Titers Elicited bv Qp160/Veu Dicistron ' 
in Non-Human Primates. 2 ma DNA per round 

Species (animal #) Titer (post 3 1^1 vaccination) 

African Green (#1 ) 70 
(#2) 45 

Rhesus (#1) 55 
(#2) 100 

'Using V1Jns-gp1 60 M iB/IRES/rei/ as the inoculation vector. 



EXAMPLE 4 

srv VACCINES 

An SIV env construct, VUn-SIV gpl52, was made by 
PCR-cloning from a genomic clone of the SIVMAC251 virus isolate and 
confirmed by DNA sequencing of both junctions with the vector. This 
strain is homologous to the virus which is used at the New England 
Regional Primate Center (NRPC) for infectious SIV challenges to 
rhesus monkeys. A similar SIV gpl52 construct is prepared in which 
the DNA encoding the leader peptide region uses alternative codons but 
which retains the native amino acid sequence. This reduces the REV- 
dependence of this construct and makes a more stable mRNA transcript. 
These vaccine constructs were prepared as follows. 

I. SIV VACCINE CONSTRUCTS: 

A). V1J-SIVMAC251 p2S ^ : The central peptide of 
SIV f>a!> , referred to as p28 f>af> , was chosen for a polynucleotide 
vaccine to test for CTL generation in nonhuman primates. This region 
of flew encodes a known CTL epitope for macaque monkeys which 
have the MHC Class I haplotype known as Mamu-AOl. Thus, monkeys 
bearing this haplotype should demonstrate CTL reactivity this gat* 
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epitope after vaccination with the appropriate gag plasmid. While both 
SIV and HIV gag genes contain regulator> r sequences which are REV 
dependent. p28 gag expression appears to be less REV -dependent so 
that at least some expression may be achieved in the absence of REV . 

SIV p28 gag was cloned into expression vectors VI using 
Bglll restriction enzyme sites after PCR amplification from the plasmid 
p239SpSp5' (obtained from the NIH AIDS Research and Reference 
Program, catalogue #829) using custom synthetic 
oiigodeoxyribonucleotides. This plasmid encodes the 5'- half of the 
SIVMAC239 genome. SIVjvlAC 239 is a subsequent in vitro passage 
line of SIVMAC25 1 which has undergone some mutations compared to 
the parental virus. However, the amino acid sequences between these 
viruses are identical for p28 gag. The PCR sense and antisense 
oligomers were 5 -GGT ACA AGA TCT ACC ATG GGA CCA GTA 
15 CAA CAA ATA GGT GGT AAC-3' , SEQ. ID:33:, and 5'-CCA CAT 
AGA TCT TTA CAT TAA TCT AGC CTT CTG TCC C-3', SEQ. 
ID:34:. These oligos provide Bglll restriction enzyme sites outside the 
translational open frames, a consensus Kozak translation initiation codon 
context (underlined) and translation stop codon (underlined). PCR- 

20 

generated p28 gag was agarose gel-purified, digested with Bglll and 
ligated into Bglll-treated, phosphatased VI. This gene was subsequently 
subcloned into our optimized expression vector, VI J, using Bglll 
restriction enzyme sites and designated as V1J-SIV p28 gag. The cloned 
gene was about 0.7 kb long. The junction sites of the VI J CMV 
promoter and 5'terminus of p28 gag were verified by DNA sequence 
analysis for each construct. In vitro expression of SIV p28 protein was 
compared for VI J and VI constructs by Western blotting using plasma 
from an SIV-infected macaque monkey to detect gag protein. The VI J- 
SIV p28 gag construct consistently gave the most product at the 
appropriate molecular weight position. Similar and even improved 
results are obtained with the more optimized Vljneo, VUns and V1R 
vectors. 
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B}. V1J-SIVMAC25] ncf : SIV ncf was cloned after PCR 
amplification from the plasmid pBK28 which encodes the entire 
SIV\tAC251 genome (a gift from Dr. Vanessa Hirsch. MAID, NIH. 
Rockvilie, MD; now listed as catalogue #133, NIH AIDS Research and 
Reference Program). The PCR sense and antisense oligomers were 5'- 
GGT ACA ACC ATG GGT GGA GCT ATT TCC ATG AGG-3', SEQ. 
ID:35: and S'-CCT AGG TTA GCC TTC TTC TAA CCT CTT CC-3\ 
SEQ- ID:36:. The Kozak site and translation stop codon are underlined. 
The amplified ncf gene was agarose gel-purified, blunt-ended using T4 
DNA polymerase, phosphoryiated at the 5*-terminus using T4 DNA 
kinase, and cloned into a blunted Bglll restriction enzyme site of VI J 
which had been phosphatased using calf intestinal alkaline phosphatase. 
The cloned gene was about 0.76 kb long. The junction site of the VI J 
CMV promoter and 5'-terminus of ncf was confirmed by DNA 
sequencing. In vitro expression was shown using Western blot analysis 
and an HIV ncf antiserum (catalogue #331, NIH AIDS Research and 
Reference Program). 

C). VUn-SIVMAC251 gpI52: SIV env , referred to as 
gpl52, was cloned after PCR amplification from the plasmid pBK2S 
into VUneo (see Nucleic Acid Pharmaceuticals patent). The PCR 
sense and antisense oligomers were 5 -GGT ACA AGA TCT ACC ATG 
GGA TGT CTT GGG AAT CAG C-3\ SEQ. ID:37: and 5 -CCA CAT 
5 AGA TCT GAT ATC GTA TGA GTC TAC TGG AAA TAA GAG G- 
3'. SEQ.ID:38. The Kozak site and amber translation stop codon are 
underlined. The PCR product has Bglll restriction enzyme sites outside 
the translation open reading frame at both ends with an additional 
EcoRV site immediately preceding the 3-terminaI Bglll site but after 
Q the amber stop codon. This provides a convenient restriction enzyme 
site for subsequent cloning steps. The amplified gpl52 gene was 
agarose gel-purified, Bglll-digested and ligated with VUn which had 
.been Bglll-digested and phosphatased. The cloned gene was about 2.2 
kb long. The junctions at each end of gpI52 with VUn CMV promoter 
and BGH terminator regions were verified by DNA sequencing. 
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II. IN VIVO VACCINATION WITH SIV VACCINES: 

Two SIV gene constructs have been used for vaccination of 
rhesus monkeys and have been shown to generate specific CTL 
responses in non-human primates (see figure 4). 

V1J-SIV p2R gag, which expresses the relatively REV- 
mdependent central peptide of gag, and V1J-SIV nef were i.m. -injected 
into Macaca mulatta monkeys at 3mg/vaccination for three injection 
rounds spaced one month apart. The ^'-specific CTL response of 
rhesus monkeys with the Mamu-AOl MHC I haplotype is restricted 
primarily to a single peptide epitope within p28 gag. Mamu-AOl + 
monkeys receiving V1J-SIV p28 gag had ^-specific CTL activity 
beginning at one month after the second injection while Mamu-AOl _ 
monkeys receiving this DNA as well as monkeys receiving VI J control 
DNA did not show a CTL response. Both in vitro, gag peptide- 
restimulated CTL as well as primary CTL were detected after the 
second and third vaccination rounds, respectively. These CTL activity 
levels were comparable to those generated by vaccinia-gag inoculation. 
Subsequently, the CTL levels declined in responding animals. These 
animals are re-vaccinated to boost the initial CTL response. V1J-SIV 
^/"-vaccinated animals have not shown a specific CTL response, 
although a more refined assay, such as the one used for gag CTL 
detection, (i.e., no dominant MHC I haplotypeAre/ peptide relationship 
has been defined for rhesus monkeys so that peptides of unknown 
effectiveness are used for stimulation, and there is no positive control), 
may provide a different result. 

EXAMPLE 5 
OTHER VACCINE CONSTRUCTS 

A. V 1 Jns-HIVTTTR gqg//w/-RRE/IRES/RFV : Dicistronic 
expression vectors encoding gag with or without the protease region of 
pol were made by PCR amplification of HTVlUB gug-pol sequences with 
several variations. Inclusion of the protease iprt) segment of pol 
allows proteolytic processing of gag into various peptides (e.g., pi 7, 
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p24. pi 5. etc.) which comprise the mature capsid particles while 
omission of this enzyme results in p55 synthesis in the form of 
immature capsid particles. More extensive sequences of pol were not 
included to avoid potential safety hazards that may be associated with 
the reverse transcriptase and integrase enzymatic activities of poL For 
capsid panicles, whether processed into the mature forms or not. to 
be extruded from cells myristoylation of the glycine amino acid at 
position two following the initial Met must occur. Mutagenesis of this 
glycine residue abrogates myristoylation and no gag particles are 
extruded from the cell. These modifications of gag allow us to 
determine whether either generation of anti-#a# CTLs following 
vaccination with such f>ai> vectors is affected by proteolytic processing 
and/or extrusion of capsid from cells. Some of the vectors listed below 
contain a splice donor (SD) site that is found upstream of the gag open 
reading frame. These vectors allow us to determine whether this SD is 
necessary for optimum rev-dependent expression of gag as was 
inclusion of the tat/rev SD for optimum gpl60 expression. 

1) VUns-HTVTTTft ea v-nrt/R R K/IR RS /R F. V • Agag-prt 
encoding DNA segment was obtained by PCR amplification using the 
following sense and antisense oligomers: 5-GGT ACA GGA TCC ACC 
ATG GGT GCG AGA GCG TCA GTA TTA AGC-3*, SEQTD:39: and 
5-CCA CAT GGA TCC GC CCG GGC TTA CAT CTC TGT ACA 
AAT TTC TAC TAA TGC-3\ SEQ.ID:40:, respectively. These 
oligomers provide BamHI restriction enzyme sites at either end of the 
segment, a Kozak initiation of translation sequence including an Ncol 
site, and an Srfl site immediately upstream of the BamHI site at the 3 - 
terminus. The Srfl site was used to clone the RRE/IRES/REV cassette 
from pGEM-3-RRE/IRES/REV, which was excised using the EcoRV 
restriction enzyme, immediately downstream of gag-prt. All ligation 
junctions were DNA sequence verified and the construct was further 
verified by restriction enzyme mapping. 

2) VUns-HIVum g - p ?7/R R E/I R E S /R F V f S D V This 
vector was prepared exactly as vector 1 above except that the PCR sense 
oligomer used was 5 -GGT ACA GGA TCC CCG CAC GGC AAG 
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AGG CGA GGG-3 SEQ.ID:4I:. This allows inclusion of the upstream 
SD site at the beginning of the xax sequence. This construct was 
verified by restriction enzyme mapping and DNA sequencing of the 
ligation junctions. 

3 ) VlJns-HIV rTTR gtfg-prr/RRE/IRES/REV(w/o 
myristoylation) : This vector is prepared exactly as vector 1 above 
except that the PCR sense oligomer used was 5'-GGT ACA GGA TCC 
ACC ATG GCT GCG AGA GCG TCA GTA TTA AGC-3\ SEQ.ID:42. 

4) VUns-HIVr rm ggg/RRE/IRES/RFV : This vector is 
prepared exactly as vector 1 above except that the PCR antisense 
oligomer used was 5'-CCA CAT GGA TCC GCC CGG GCC TTT ATT 
GTG ACG AGG GGT CGT TGC-3\ SEQ.ID:43. 

5i VlJns-HIVT Tm i^'/RRE/IRES/REV fSDV This vector 
is prepared exactly as vector 4 above except that the PCR sense 
oligomer used was 5'-GGT ACA GGA TCC CCG CAC GGC AAG 
AGG CGA GGG-3\ SEQ.ID:44. 

6) VIJns-HrVTrm ggg/RRE/IRES/REV fw/o 
myristoylation) : This vector is prepared exactly like vector 5 except 
that the PCR sense oligomer used was 5 -GGT ACA GGA TCC ACC 
ATG GCT GCG AGA GCG TCA GTA TTA AGC-3', SEQ.ID:45. 

B. VUns-HIV nef \ This vector uses a ncf gene from a 
viral strain representative of those in the infected population using sense 
and antisense PCR oligomers analogous to those used for SIV nef. 

C. pGEM-3-X-IRES-B7 : (where X = any antigenic gene) 
As an example of a dicistronic vaccine construct which provides 
coordinate expression of a gene encoding an immunogen and a gene 
encoding an immunostimulatory protein, the murine B7 gene was PCR 
amplified from the B lymphoma cell line CHI (obtained from the 
ATCC). B7 is a member of a family of proteins which provide essential 
costimulation T cell activation by antigen in the context of major 
histocompatibility complexes I and IL CHI cells provide a good source 
of B7 mRNA because they have the phenotype of being constitutive!}' 
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activated and B7 is expressed primarily by activated antigen presenting 
cells such as B cells and macrophages. These cells were further 
stimulated in vitro using cAMP or IL-4 and mRNA prepared using 
standard guanidinium thiocyanate procedures. cDNA synthesis was 
performed using this mRNA using the GeneAmp RNA PCR kit (Perkin 
-Elmer Cetus) and a priming oligomer (5-GTA CCT CAT GAG CCA 
CAT AAT ACC ATG-3'. SEQ.ID:46:) specific for B7 located 
downstream of the B7 translational open reading frame. B7 was 
amplified by PCR using the following sense and antisense PCR 
oligomers: 5 -GGT ACA AGA TCT ACC ATG GCT TGC AAT TGT 
CAG TTG ATG C-3\ SEQ.ID:47:, and 5-CCA CAT AGA TCT CCA 
TGG GAA CTA AAG GAA GAC GGT CTG TTC-3', 
SEQ.ID:48:, respectively. These oligomers provide Bglll restriction 
enzyme sites at the ends of the insert as well as a Kozak translation 
initiation sequence containing an Ncol restriction site and an additional 
Ncol site located immediately prior to the 3-terminal Bglll site. Ncol 
digestion yielded a fragment suitable for cloning into pGEM-3-IRES 
which had been digested with Ncol. The resulting vector, pGEM-3- 
IRES-B7, contains an IRES-B7 cassette which can easily be transferred 
to VUns-X, where X represents an antigen-encoding gene. 

D. pGEM-3-X-IRES-GM-CSF : (where X = any antigenic 
gene) This vector contains a cassette analogous to that described in item 
C above except that the gene for the immunostimulatory cytokine, GM- 
CSF, is used rather than B7. GM-CSF is a macrophage differentiation 
and stimulation cytokine which has been shown to elicit potent anti- 
tumor T cell activities in vivo [G. Dranoff et al.. Proc. Natl. Acad. Sci. 
USA, 90, 3539 (1993). 

E. pGEM-3-X-IRES-IL-12 : (where X = any antigenic 
gene) This vector contains a cassette analogous to that described in item 
C above except that the gene for the immunostimulatory cytokine, IL- 
12, is used rather than B7. IL-I2 has been demonstrated to have an 
influential role in shifting immune responses towards cellular, T cell- 
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dominated pathways as opposed to humoral responses [L. Alfonso et al.. 
Science, 263, 235, 1994]. 

F. VUns-HIVx gpl60/IRES/rfv TTTR (SD) : This vector is 
analogous to the one described in I.B. above except that gpl60 genes 
derived from various clinical strains are used rather than gpl60 derived 
from laboratory strain II1B. 

G. VUns-PRR/34/HA-IRES-SIV p2R gag 

This construct provides an influenza hemagglutination gene 
(HA) in concert with the SIV p28 gag gene for coordinate expression 
via the IRES element. The PR8/34/HA gene was amplified by PCR 
using the following sense and antisense oligomers: 5-GGT AC A AG A 
TCT ACC ATG AAG GCA AAC CTA CTG GTC CTG-3', 
SEQ.ID:49:, and 5-CCA CAT AGA TCT GAT ATC CTA ATC TCA 
GAT GCA TAT TCT GCA CTG C-3\ SEQ.ID:50:, respectively. The 
resulting DNA segment has Bglll restriction enzyme sites at either end 
and an EcoRV site at the 3'-terminus. After Bglll digestion this gene 
was cloned into VUns which had been digested with Bglll followed by 
alkaline phosphatase treatment. SIV p28 gag was excised from V1J-SIV 
p28 gag by Ncol and BglD digestion. pGEM-IRES was digested with 
Ncol and BamHI for directional ligation with p28 #<2£/NcoI/BglII. The 
IRES-p28 gag cassette is removed by restriction digestion with Smal 
and Hindll and iigated into the EcoRV site of VI Jns-A/PRR/HA. In 
vivo coordinate expression of these genes allows generation of potent 
antibody responses by PNV vaccination (HA), with requisite T cell help, 
which provides such help in a local environment to potentiate the CTL 
response of the second gene product (SIV p28 gag). This construct also 
demonstrates the ability to use the PNV and method of this invention to 
generate immune responses against multiple antigens whether or not 
related to HIV. Those skilled in the art will appreciate that this type of 
construct could be mixed with other, bi- or tri-cistronic constructs to 
produce a multivalent combination polynucleotide vaccine. 
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H. VIJn^-tPA-gpl60 nig /IRES/SiV p28 mum : VUns-tPA- 
gpl60niB was digested with EcoRV, treated with calf intestinal alkaline 
phosphatased. and ligated with IRES-SIV p28 gag which had been 
removed from pGEM-3-IRES-SIV p28 by restriction enzyme digestion 
using Smal and Hindll. In vivo coordinate expression of these genes 
allows coupling a protein which generates strong helper T cell responses 
(gpJ60) to one which provides Class I MHC-associated CTL epitopes 
(SIV p28 gag). This vaccine is designed for immunization of rhesus 
monkeys for generation of anu-env neutralizing antibodies and CTL as 
well as anti-SIV gag CTL. These monkeys are subsequently challenged 
with appropriate SHIV viral challenges [see J. Li et al.. J. A.I.D.S. 5, 
639-646 (1992)]. 

I. VUns-tPA-gpl20 TTTR /IRES/SfV p28 vav : This vector 
is constructed exactly as VlJns-tPA-gpl60rHB/IRES/SrV p28 vai> 
except that VI Jns-tPA-gpl20niB is used in place of the gp!60 gene. 
Vaccination and SHIV challenge are conducted as described above. 

J. VlJns-tPA-gpl20T TTR /IRES/HTVgqg /IRES/m • : This 
vector is similar to those described above except that a tricistron 
provides gag and rev expression in addition to gpl20. 

K. VlJns-tPA-gpl60n m /IRES/HIV vav /IRES /rev: This 
vector is similar to those described above except that a tricistron 
provides gay and rev expression in addition to gpI60. 

EXAMPLE 6 

ASSAY FOR HIV CYTOTOXIC T-L YMPHOCYTRS : 

The methods described in this section illustrate the assay as 
used for vaccinated mice. An essentially similar assay can be used with 
primates except that autologous B cell lines must be established for use 
as target cells for each animal. This can be accomplished for humans 
using the Epstein-Barr virus and for rhesus monkey using the herpes B 
virus. 
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Peripheral blood mononuclear cells fPBMC.i are derived 
from either freshly drawn blood or spleen using Ficoll-Hypaque 
centrifugation to separate erythrocytes from white blood cells. For 
mice, lymph nodes may be used as well. Effecter CTLs may be 
prepared from the PBMC either by in vitro culture in IL-2 (20 U/ml) 
and concanavalin A (2ug/ml) for 6-12 days or by using specific antigen 
using an equal number of irradiated antigen presenting cells. Specific 
antigen can consist of either synthetic peptides (9-15 amino acids 
usually) that are known epitopes for CTL recognition for the MHC 
haplotype of the animals used, or vaccinia virus constructs engineered to 
express appropriate antigen. Target cells may be either syngenic or 
MHC haplotype-matched cell lines which have been treated to present 
appropriate antigen as described for in vitro stimulation of the CTLs. 
For Balb/c mice the PI 8 peptide 

(ArglleHisIleGlyProGlyArgAlaPheTyrThrThrLysAsn, SEQ.ID:51:, for 
HIV MN strain) can be used at 10 \iM concentration to restimulate CTL 
in vitro using irradiated syngenic splenocytes and can be used to 
sensitize target ceils during the cytotoxicity assay at 1-10 \iM by 
incubation at 37°C for about two hours prior to the assay. For these 
H-2 d MHC haplotype mice, the murine mastocytoma cell line, P815, 
provides good target cells. Antigen-sensitized target cells are loaded 
with Na 5 lCr04, which is released from the interior of the target cells 
upon killing by CTL, by incubation of targets for 1-2 hours at 37°C 
(0.2 mCi for -5 x 10^ cells) followed by several washings of the target 
cells. CTL populations are mixed with target cells at varying ratios of 
effectors to targets such as 100:1, 50:1, 25:1, etc., pelleted together, and 
incubated 4-6 hours at 37°C before harvest of the supernatants which 
are then assayed for release of radioactivity using a gamma counter. 
Cytotoxicity is calculated as a percentage of total releasable counts from 
the target cells (obtained using 0.2% Triton X-100 treatment) from 
which spontaneous release from target cells has been subtracted. 
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EXAMPLE 7 
ASSAY FOR HIV SPECIFIC ANTIBODIES: 

ELISAs were designed to detect antibodies generated 
against HIV using either specific recombinant protein or synthetic 
peptides as substrate antigens. 96 well microliter plates were coated at 
40C overnight with recombinant antigen at 2 jig/ml in PBS (phosphate 
buffered saline) solution using 50 (il/well on a rocking platform. 
Antigens consisted of either recombinant protein (gpl20, rev: Repligen 
Corp.; gp!60, gp41: American Bio-Technologies, Inc.) or synthetic 
peptide (V3 peptide corresponding to virus isolate sequences from IIIB, 
etc.: American Bio-Technologies, Inc.; gp4I epitope for monoclonal 
antibody 2F5). Plates were rinsed four times using wash buffer 
(PBS/0.05% Tween 20) followed by addition of 200^1/well of blocking 
buffer (1% Carnation milk solution in PBS/0.05% Tween-20) for 1 hr 
at room temperature with rocking. Pre-sera and immune sera were 
diluted in blocking buffer at the desired range of dilutions and 100 uJ 
added per well. Plates were incubated for I hr at room temperature 
with rocking and then washed four times with wash buffer. Secondary 
antibodies conjugated with horse radish peroxidase, (anti-rhesus Ig, 
Southern Biotechnology Associates; ami- mouse and anti-rabbit Igs, 
Jackson lmmuno Research) diluted 1:2000 in blocking buffer, were then 
added to each sample at 100 uJ/well and incubated 1 hr at room 
temperature with rocking. Plates were washed 4 times with wash buffer 
and then developed by addition of 100 (il/well of an o-phenylenediamine 
(o-PD, Calbiochem) solution at 1 mg/ml in 100 mM citrate buffer at pH 
4.5. Plates were read for absorbance at 450 nm both kinetically (first 
ten minutes of reaction) and at 10 and 30 minute endpoints (Thermo- 
max microplate reader, Molecular Devices). 

EXAMPLE 8 
ASSAY FOR HIV NEUTRALIZING ANTTROnrFS 

In vitro neutralization of HIV isolates assays using sera 
derived from vaccinated animals was performed as follows. Test sera 
and pre-immune sera were heat inactivated at 56°c for 60 min before 
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use. A titrated amount of HIV- 1 was added in 1:2 serial dilutions of test 
sera and incubated 60 min at room temperature before addition to I(P 
MT-4 human lymphoid cells in 96 well microtiter plates. The virus/cell 
mixtures were incubated for 7 days at 37°C and assayed for virus- 
mediated killing of cells by staining cultures with tetrazolium dye. 
Neutralization of virus is observed by prevention of virus-mediated cell 
death. 

EXAMPLE 9 

PROTECTION OF CHIMPANZEES UPON CHALLENGE WITH 
VIRULENT HIV-1: 

The only animal HIV challenge model to date is with 
chimpanzees. While chimpanzees do not develop HIV-related 
immunodeficiency disease they can be infected with some HTV viral 
isolates. The most common strain used to date in this model is the IETB 
strain (BH10) although challenge stocks for other isolates are being 
developed, e.g., for SF2. We envision vaccination of chimpanzees in an 
analogous manner to vaccination in other nonhuman primates using HIV 
env and na^-pol constructs derived from the HIV-1 IIIB strain (HXB2 
clone) as described within this document to achieve anti-HIV humoral 
and cellular responses. While the BH10 challenge virus for 
chimpanzees is IIIB derived as are our vaccination construct genes, 
there is heterogeneity within this virus so that HXB2 is only one of at 
least three variations of IIIB present in the viral inoculum. Thus, the 
IIIB challenge experiment of HXB2 gene vaccinated monkeys is not 
completely homologous. 

We are vaccinating chimpanzees 3-5 rounds with 
polynucleotide HIV gene vaccines with doses of 0.1-3 mg of 
plasmid/round. After characterization of vaccine-induced humoral and 
CTL anti-HIV responses these monkeys are challenged with 10 to 140 
CID50 (50% chimpanzee infectious dose) by an intravenous 
administration of HIV-1 [jib inoculum diluted 1:25 in physiologic saline 
just prior to use. Infection of chimpanzees is monitored by detection of 
HIV-1 virus specific DNA sequences using DNA derived from PBMC 
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obtained from test chimpanzees, (see Example 10 for details). Vaccine- 
mediated protection can be described as a range of responses to 
challenge virus from complete sterilizing immunity (inability to detect 
virus post infection) to significant reductions and/or delay in viremia 
induced by the challenge stock. While sterilizing immunity is clearly the 
most preferred response to vaccination, reduced or delayed viremia 
may significantly influence onset of immunodeficiency disease in human 
vaccinees. 

EXAMPLE 10 

ISOLATION OF GENES FROM CLINICAL HIV ISOLATES: 

HIV viral genes were cloned from infected PBMC's which 
had been activated by ConA treatment. The preferred method for 
obtaining the viral genes was by PCR amplification from infected 
cellular genome using specific oligomers flanking the desired genes. A 
second method for obtaining viral genes was by purification of viral 
RNA from the supernatants of infected cells and preparing cDNA from 
this material with subsequent PCR. This method was very analogous to 
that described above for cloning of the murine B7 gene except for the 
PCR oligomers used and random hexamers used to make cDNA rather 
than specific priming oligomers. 

Genomic DNA was purified from infected cell pellets by 
lysis in STE solution (10 mM NaCl, 10 mM EDTA, 10 mM Tris-HCl, 
pH 8.0) to which Proteinase K and SDS were added to 0.1 mg/ml and 
0.5% final concentrations, respectively. This mixture was incubated 
overnight at 56°C and extracted with 0.5 volumes of 
phenol:chloroform:isoamyl alcohol (25:24:1). The aqueous phase was 
then precipitated by addition of sodium acetate to 0.3 M final 
concentration and two volumes of cold ethanol. After pelleting the 
DNA from solution the DNA was resuspended in 0.1 X TE solution (IX 
TE = 10 mM Tris-HCl, pH 8.0. 1 mM EDTA). At this point SDS was 
added to 0. 1 % with 2 U of RNAse A with incubation for 30 minutes at 
37°C. This solution was extracted with phenol/chloroform/isoamyl 
alcohol and then precipitated with ethanol as before. DNA was 
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suspended in 0. 1 X TE and quantitated by measuring its ultraviolet 
absorbance at 260 nm. Samples were stored at -2vJ°C until used for 
PCR. 

5 PCR was performed using the Perkin-Elmer Cetus kit and 

procedure using the following sense and antisense oligomers for gp!60: 
5 -GA AAG AGC AGA AGA CAG TGG CAA TGA -3', SEQ.ID:52: 
and 5 -GGG CTT TGC TAA ATG GGT GGC AAG TGG CCC GGG C 
ATG TGG-3', SEQ.ID:53:, respectively. These oligomers add an Srfl 
site at the 3'-terminus of the resulting DNA fragment. PCR-derived 
segments are cloned into either the VUns or V1R vaccination vectors 
and V3 regions as well as ligation junction sites confirmed by DNA 
sequencing. 

15 EXAMPLE 1 1 

SEQUENCES ACROSS VACCINE CONSTRUCT JUNCTIONS: 
Genes were cloned according to Example 10. Ln each case, the junction 
sequences from the 5' promoter region (CMVintA) into the cloned gene 
was sequenced using the primer: 

2Q CMVinta primer 5'- CTA ACA GAC TGT TCC TTT CCA TG- 3\ 
SEQ. ID:54:, which generates the sequence of the coding sequence. 
This is contiguous with the terminator/coding sequence, the junction of 
which is also shown. This sequence was generated using the primer: 
BGH primer 5'- GGA GTG GCA CCT TCC AGG -3\ SEQ. ID:55:, 

25 which generates the sequence of the non-coding strand. In every case, 
the sequence was checked against known sequences from GENBANK 
for cloned and sequenced genes from these or other HIV isolates. The 
position at which the junction occurs is demarcated by a V, which does 
not represent any discontinuity in the sequence. The first "ATG" 

3o encountered in each sequence is the translation initiation codon for the 
respective cloned gene. Each sequence provided represents a complete, 
available, expressible DNA construct for the designated HIV gene. The 
nomenclature follows the convention: "Vector name-HIV strain-gene". 
The biological efficacy of each of these constructs is shown in the same 
manner as in the foregoing Examples: 
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SEQUENCE ACROSS THE 5 JUNCTIONS OF CMVintA AND THE 
HIV GENES AND ACROSS THE 3' JUNCTIONS OF THE HIV 
GENES AND THE BGH TERMINATOR EXPRESSION 
5 CONSTRUCTS. USING DIFFERENT HIV STRAINS AND 
PROTEINS: 

1 VlIns-rgv uiB: 

SEQ-ID;56: 

10 

5' GGA GAC AGC GACGAA GAC CTC CTC AAG GCA GTC AGA CTC ATC AAG-3 
rev .... 

(Sequence begins at the 5'- terminus within the PCR oligomer. See #7 below 
for complete rev 5'- terminus sequence) 

15 SEQ.ID:57: 

5-GAT GGC TGG CA A CTA G A A GGC ACA GCA GAT CT/ GAT ATC GCA CTA 
BGH rev,., 

TTC TTT AGC TCC TG A CTC CAA TAT TGT-3' 

2 Vlin^fiiUMlUB: 



SEQ.ID:5R: 

2 5' CTT AGA TO A ACC ATG AGA GTG AAG GA GA A ATA TCA GCA CTT GTG 
CMVinta gpl60 

GAG ATG GGG GTG GAG ATG GGG CAC CAT GCT CCT TGG GAT GTT GAT GAT 
CTG TAG TGC TAC AGA AAA ATT GTG GGT-3' 

30 

SEQ-ID:59: 

5'-CTG GCA ACT AGA AGG CAC AGC AGA TO A GAT ACT GTC CCC ATC TTA 
BGH gp!60 



TAG CAA AAT CCT TTC CAA GCC CTG TCT TAT TCT-3 
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3. pGEM-3-IRES: [sequenced using SP6 (5 -GAT TTA GGT GAC ACT 
ATA G-3\ SEOTD:60:) and T7 (5'-TAA TAC GAC TCA CTA TAG GG-3'. 
5 SEQ.ID:61:) primers, Promega Biotech] 

SEQTD:62: 

5'-CAT GCC TGC AGG TCG ACT CTA/ A AT TCC G... 
pGEM-3 (SP6) IRES 

10 

SEQ.ID:63: 

5 -A CCC GGG GAT CCT CT/ A GCG CGC TTG TCT CTT GTT CCA.. 
pGEM-3 (T7) IRES 

15 4 * pGEM-3-lRES/rgv niB : [sequenced using T7 sequencing primer 

(Promega) for rev 3'-end, and] IRES 3'- oligomer (5 -GG GAC GTG GTT 
TTC C-3\ SEQ.ID:64:) for IRES/rcv junction] 

SEQ.ID:65: 

20 

5-TAT GGC CAC AAC O AT GGC AGG AAG AAG CGG AG A CAG CGA CGA AGA 

IRES rev 
CCT CCT CAA GGC AGT CAG ACT -3' 

SEQTD:66: 

25 5 -CTC GAG CCA TGG GCC CCT/ AGA CTA TAG CGT GAT AAG AAA TCG AGG 
pGEM-3 rev 

ACT GAG GTT ATA AC A TCC TCT AAG GTG GTT ATA AAC TCC CGA AGG-3 

30 5. pGEM-3-RRE/IRES/rgvmfi: [using SP6 sequencing oligomer 
(Promega) and IRES 5'- oligomer, 5*-G CTT CGG CCA GTA ACG-3\ 
SEQ.ID:67:] 

SEQ.IE>:68: 

5' TTG CAT GCC TGC AGG T/ GGT ACA TGA TCA GAT ATC G CCC GGG / C 
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pGEM-3 RRE 

CGA GAT CTT CAG ACT TGG AGG AGG AGA TAT GAG GGA CAA TTG GAG- 3 
IRES-5' 

SEQ.ID:69: 

5 -GGG GCG G AA TT/ T AG A GTC A/ ATT GAT CAG CTT GTG TAA TTG TTA 

RRE-3' 

ATT TCT CTG TCC CAC TCC ATC CAG GTC GTG TGA TTC...-3* 

6. VUi\$-{tatlrev SP): [used for VUns-gp 160lHB/IRES//f vine (SD) and 

VI Jns-gpl60niB(SD); sequenced using an oligomer complementary to 
gpl60 reading towards 5 -end of gpl60 and into CMVintA: 5-CCA 
TCT CCA CAA GTG CTG-3\ SEQ.ID:70:] 

SEQ.ID:71: 

5 -AGA TCT A AGG ACG GTG ACT GCA / TGT ACT ACT TAC TGC TTT GAT 
CMVintA tat/rev SD 

AGA GGA CGG TGA / CTG CAG AAA AGA CCC ATG G AA A-3' 

CMVintA 

7. Vl.!ns-gp160inp/IRES/r g viqF (SDV [gpl60/IRES junction 
sequenced using IRES 5'- oligomer, 5'-G CTT CGG CCA GTA ACG-3', 
SEQ.ID:72:] 

SEQ.ID:73: 

5 -GGC ACA GCA GAT C/ AG ATG GGG ATC TGA TA TCG CAC TAT TCT TTA 
BGH rev 

GCT CCT GAC TCC TGA CTC-3 



SEQUD:74: 
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5'-GGA ATT/ TGA GTC ATC / CCC ATC TTA TAG CAA A AT CCT TTC CAA -3 

IRES gp i60 
«. VUns-gflf-prry ^ (SD) : 
SEQ.ID:75: 

5 -CTT AGA TC/ C CCG CAC GGC AAG AGG CGA GGG GCG GCG ACT GGT-3 
CMVintA gag (SD) 

SEQ.ID:76: 

5-GGC ACA GCA GAT CI CGC CCG GGC TTA CAT CTC TGT ACA A AT TTC TAC 
BGH prt 

TAA TGC TTT TAT TTT TCT TCT GTC... -3' 

9 Vl.lns-gflg-i?rt iiiB : 



20 



SEQ.ID:77: 

5 ' CTT AGA TC/ CAC CAT GGG TGC GAG AGC GTC ACT ATT AA GCG GGG 
CMVintA gag 



GGA GAA TTA GAT CGA TGG GAA AAA ATT...-3 



SEQ.ID:7X: 

5-GGC ACA GCA GAT CI CGC CCG GGC TTA CAT CTC TGT ACA AAT TTC TAC 
BGH art 



TAA TGC TTT TAT TTT TCT TCT GTC . 3 



30 

10. VU ns - tP A: 



SEQ.ID:79: 

5 -TCA CCG TCC TTA GAT O ACC ATG GAT GCA ATG AAG AGA GGG CTC TGC 
CMVintA tPA leader 
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TGT GTG CTG CTC. CTG TGT GGA GCA GTC TTC GTT TCG CCC AGC G A/ G ATC 
BGH 

5 

TGC TGT GCC TTC TAG TTG CCA GCC-3' 

11. vi.ln.HPA-gpUQMN- 

10 SEQ.ID:80: 

5 -TTC GTT TCG CCC AGC G A/ TCA CAG AAA AAT TGT GGG TCA CAG TC-3' 
tPA gpl20MN 

SEQ.ID:81: 

15 

5-GGC ACA GCA GAT C/ CAC GTG TTA GCG CTT TTC TCT CTC CAC CAC-3' 
BGH gpl20MN 

12. V1.I-SIVMAC25LP2S tn 

SEQ.ID:82: 

5 -TCA CCG TCC TTA GAT CT/ ACC ATG GGA CCA GTA CAA CAA ATA GGT 

CMVintA p28 gag... 

GGT AAC TAT GTC CAC CTG CCA TTA AGC CCG AG A ACA-3' 

SEQ.ID:83: 

5 -GGC ACA GCA GAT CT/ TTA CAT TAA TCT AGC CTT CTG TCC CGG TCC-3 
BGH p28 gag 

30 13. Vl,l-SIVMA£l£li^ 

SEQ.ID:84: 

5 -TCA CCG TCC TTA GAT C/ GGT ACA ACC ATG GGT GGA GCT ATT TCC ATG 
CMVintA nef 



20 



25 
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AGG CAA TCC AAG CCG GCT GGA GAT CTG ACA GAA A-.V 
SEQ.ID:X5: 

5'-GGC ACA GCA GAT CA/ C CTA GGT TAG CCT TCT TCT AAC CTC TTC CTC 

BGH nef.... 
TGA CAG GCC TGA CTT GCT TCC AAC TCT TCT GGG TAT CTA G-.T 

14. X\]T\s-tat!revlenv . 

SEQ.ID:86: 

5-ACC GTC CTT AGA T/ TC GAC ATA GCA GAA TAG GCG TTA CTC GAC AGA 
CMVintA tatlrevienv 

GGA GAG CAA GAA ATG GAG CCA GTA GAT CCT AGA CTA GAG CCC TGG-T 

SEQ.ID.87: 

5 -GGC ACA GCA GAT C/ C GAG ATG CTG CTC CCA CCC CAT CTG CTG-3' 
BGH tatlrevienv 

EXAMPLE 12 
T CELL PROLIFERATION ASSAYS : 

PBMCs can be obtained as described in Example 6 from 
above and tested for recall responses to specific antigen as determined 
by proliferation within the PBMC population. Proliferation is 
monitored using ^H-thymidine which is added to the cell cultures for 
the last 18-24 hours of incubation before harvest. Cell harvesters retain 
isotope-containing DNA on filters if proliferation has occurred while 
quiescent cells do not incorporate the isotope which is not retained on 
the filter in free form. For either rodent or primate species 4 X 
cells are plated in 96 well microtiter plates in a total of 200 ^il of 
complete media (RPMI/10% fetal calf serum). Background 
proliferation responses are determined using PBMCs and media alone 
while nonspecific responses are generated by using lectins such as 
phytohaemagglutin (PHA) or concanavalin A (ConA) at 1- 5 jig/ml 
concentrations to serve as a positive control. Specific antigen consists of 
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either known peptide epitopes, purified protein, or inactivated virus. 
Antigen concentrations range from 1-10 [iM for peptides and 1-10 
fig/ml for protein. Lectin-induced proliferation peaks at 3-5 days of 
cell culture incubation while antigen-specific responses peak at 5-7 days. 
Specific proliferation occurs when radiation counts are obtained which 
are at least three -fold over the media background and is often given as a 
ratio to background, or Stimulation Index (SI). HIV gpl60 is known to 
contain several peptides known to cause T cell proliferation of 
gpI60/gpl20 immunized or HIV-infected individuals. The most 
commonly used of these are: Tl 

(LysGlnllelleAsnMetTrpGlnGluValGlyLysAlaMetTyrAla, 
SEQ.ID:88:); T2 (HisGluAspIielleSerLeuTrpAspGlnSerLeuLys, 
SEQ.ID:89:); and, TH4 

(AspArgVallleGluValValGlnGlyAalTyrArgAlalleArg, SEQ.ID:90:). 
These peptides have been demonstrated to stimulate proliferation of 
PBMC from antigen-sensitized mice, nonhuman primates, and humans. 

REFERENCES : 

L. Arthur et a!., J. Virol. 63, 5046 (1989). [chimp/HIV challenge 

model/virus neut. assay] 
T. Maniatis et al., Molec. cloning: a lab. manual, p. 280 Cold Spring 
Harbor 

Lab., CSH, NY (1982) [genomic DNA purif.] 
E. Emini et al.. J. Virol. 64, 3674 (1990) [chimp challenge, neut assay] 

EXAMPLE 13 

Vector V 1 R Preparation 

In an effort to continue to optimize our basic vaccination 
vector, we prepared a derivative of VUns which was designated as 
V1R. The purpose for this vector construction was to obtain a 
minimum-sized vaccine vector, i.e., without unnecessary DNA 
sequences, which still retained the overall optimized heterologous gene 
expression characteristics and high plasmid yields that V1J and VUns 



WO 95/24485 



PCT7US95/02633 



afford. We determined from the literature as well as by experiment 
that (1) regions within the pUC backbone comprising the E. coli origin 
of replication could be removed without affecting plasmid yield from 
bacteria; (2) the 3-region of the kan x gene following the kanamycin 
open reading frame could be removed if a bacterial terminator was 
inserted in its stead; and, (3) -300 bp from the 3 - half of the BGH 
terminator could be removed without affecting its regulatory function 
(following the original Kpnl restriction enzyme site within the BGH 
element). 

V ] R was constructed by using PCR to synthesize three 
segments of DNA from VUns representing the CMVintA 
promoter/BGH terminator, origin of replication, and kanamycin 
resistance elements, respectively. Restriction enzymes unique for each 
segment were added to each segment end using the PCR oligomers: 
Sspl and Xhol for CMVintA/BGH; EcoRV and BamHI for the kan r 
gene; and, BclJ and Sail for the ori r . These enzyme sites were chosen 
because they allow directional ligation of each of the PCR-derived DNA 
segments with subsequent loss of each site: EcoRV and Sspl leave blunt- 
ended DNAs which are compatible for ligation while BamHI and Bell 
leave complementary overhangs as do Sail and Xhol. After obtaining 
these segments by PCR each segment was digested with the appropriate 
restriction enzymes indicated above and then ligated together in a single 
reaction mixture containing all three DNA segments. The 5'-end of the 
ori r was designed to include the T2 rho independent terminator 
sequence that is normally found in this region so that it could provide 
termination information for the kanamycin resistance gene. The ligated 
product was confirmed by restriction enzyme digestion (>8 enzymes) as 
well as by DNA sequencing of the ligation junctions. DNA plasmid 
yields and heterologous expression using viral genes within V1R appear 
similar to VUns. The net reduction in vector size achieved was 1346 bp 
(VUns = 4.86 kb; V1R = 3.52 kb), see figure 11, SEQ.ID:45:. 

PCR oligomer sequences used to synthesize V1R (restriction enzyme 
sites are underlined and identified in brackets following sequence): 
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(1) 5*-GGT ACA AAT ATT GG CTA TTG GCC ATT GCA TAC G-3' [Sspl], 
SEQ.ID:91:. 

5 (2) 5'-CCA CAT CTC GAG GAA CCG GGT CAA TTC TTC AGC ACC-3* [Xhol], 
SEQ.ID:92: 

(for CMVintA/BGH segment) 

(3) 5'-GGT ACA GAT ATC GGA AAG CCA CGT TGT GTC TCA AAA TC- 
3'[EcoRV], SEQ.ID:93: 

(4) 5'-CCA CAT GGA TCC G TAA TGC TCT GCC AGT GTT ACA ACC-3' 
[BamHI], SEQ.!D:94: 

(for kanamycin resistance gene segment) 

1 5 (5) 5'-GGT ACA TGA TCA CGT AGA AAA GAT CAA AGG ATC TTC TTG- 
3'[Bcll], SEQ.ID:95:, 

(6) 5'-CCA CAT GTC GAC CC GTA AAA AGG CCG CGT TGC TGG-3* [Sail], 
SEQ.ID:96: 

(for E. coli origin of replication) 

20 Ligation junctions were sequenced for V1R using the following 
oligomers: 

5'-GAG CCA ATA TAA ATG TAC-3', SEQ.ID:97: [CMVintA/kan^ 
junction] 

5*-CAA TAG CAG GCA TGC-3*. SEQ.ID:98: [BGH/ori junction] 
25 5'-G CAA GCA GCA GAT TAC-3', SEQ.ID:99: [ori/kanr junction] 

EXAMPLE 14 
The HIV genes which appear to be the most important for 
PNV development are env and gag. Both env and gag require the HIV 
}0 regulatory protein, rev, for either viral or heterologous expression. 
Because efficient expression of these gene products is essential for PNV 
function, two types of vectors, rev-dependent and rev-independent, 
were tested for vaccination purposes. Unless stated otherwise, all genes 
were derived from the HIV-1 (IITB) laboratory isolate. 
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A. env : Depending upon how large a gene segment is used, varying 
efficiencies of rev-independent envy expression may be achieved by 
replacing the native leader peptide of env with the leader peptide from 
the tissue-specific plasminogen activator (tPA) gene and expressing the 
resulting chimeric gene behind the CMV promoter with the CMV 
intronA. V 1 Jns-tPA-gpl 20 is an example of a secreted gpl20 vector 
constructed in this fashion which functions to yield anti-gpl20 immune 
responses in vaccinated mice and monkeys. 

Published reports indicate that membrane-anchored 
proteins may induce a more substantial antibody responses compared to 
secreted proteins. Membrane-anchored proteins may also induce 
antibody responses to additional immune epitopes. To test this 
hypothesis, VUns-tPA-gp 160 and V 1 Jns-rev/env were prepared. The 
tPA-gpl60 vector produced detectable quantities of gpI60 and gpl20, 
without the addition of -v, as shown by immunoblot analysis of RD 
cells transfected in vitro , although expression was much lower than that 
obtained for rev/env, a rev-dependent gp 160-expressing plasmid. This 
may be due to the presence of inhibitory regions, which confer rev 
dependence upon the gp!60 transcript occur at multiple sites within 
gpl60 including at the COOH-terminus of gp41. 

Vectors containing truncated forms of tPA-gpl60, tPA- 
gpl43 and tPA-gpl50, designed to increase the overall expression of 
env by elimination of these inhibitory sequences, were prepared. The 
truncated gpl60 vectors lack intracellular gp41 regions containing 
pepride motifs (such as leu-leu) which are known to cause diversion of 
membrane proteins to the lysosomes rather than the cell surface. Thus, 
gpl43 and gpl50 may be expected to increase the transport of protein to 
the cell surface compared to full-length gpl60 where these proteins may 
be better able to elicit anti-gp!60 antibodies following DNA 
vaccination. 

A quantitative ELISA for gpl60/gpl20 expression in cell 
transfectants was developed to determine the relative expression 
capabilities for these vectors as well as for an additional vector which 
combines the features of tPA-gp!60 and rev/env (vector rev/tPA- 
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ep J 60). In vitro transfection of 293 cells followed by quantitation of 
cell-associated vs. secreted/released gpl 20 yielded the following results: 

(1) for the analogous plasmid pair, rev/env and rev/tPA- 
gp 1 60, substitution of the native leader peptide in gpl 60 with the tPA 
leader sequence did not increase the total expression of gp 1 60 or the 
amount of released gpl 20. This suggests that the leader peptide is not 
responsible for inefficient trafficking of gpl 60 to the cell surface in 
these cells. 

(2) tPA-gpl60 expresses 5-10X less gpl60 than rev/env 
with similar proportions retained intracellular^ vs. trafficked to the 
cell surface. 

(3) tPA-gp!43 gave 3-6X greater secretion of gpl 20 than 
rev/env with only low levels of cell-associated gp 143 confirming that 
the cytoplasmic tail of gpl 60 causes intracellular retention of gpI60 
which can be overcome by partial deletion of this sequence. 

(4) tPA-gpl50 gave only low levels of gpl 60 in both cells 
and media, indicating either a problem with this construct or inherent 
instability of the truncated protein. 

tPA-gpl20 derived from a primary HIV isolate (containing 
the North American consensus V3 peptide loop; macrophage tropic and 
nonsyncytia-inducing phenotypes) gave high expression/secretion of 
gpl 20 with transfected 293 cells demonstrating that it was cloned in a 
functional form. 

EXAMPLE 15 

Serological Assays 

A. Antibody Responses: 

1. gpl 20 PNVs 

An ID vs. IM vaccination experiment in mice was 
completed using VI Jns-tPA -gpl 20 (100, 10. I ng: 3X). ID vaccination 
appeared superior at the lower doses following initial rounds but all 
doses were equivalent after three rounds. 

Rhesus monkeys (RHM) and African green monkeys 
(AGM) were vaccinated with the VUns-tPA-gpl20 (MN) PNV. Peak 
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GMTs for gpl20 antibodies differed by more than five-fold between 
these two primate species: 1780 (AGM) and 310 (RHM). These results 
indicate that substantially larger antibody titers can be elicited in AGM 
compared to RHM and suggest that higher HIV neutralization titers may 
be obtained by AGM vaccination. 

2. ep!60 PNVs : VUns-rev/env vaccination flM) of mice 
did not yield antibodies to gpl60 until three injections while ID 
vaccination yielded responses after one round which remained higher 
than those produced by IM throughout the experiment (GMTs = 2115 
(ID) and 95 flM); 200 jig/mouse). This suggests that rev-dependent 
constructs can function as immunogens better by the ID route. 

RHM receiving ID or IM inoculations with VUns-rev/env 
showed peak GMTs = 790 and 140, respectively, following 4-5 
inoculations (2 mg/round). These results agree with those found for 
mice showing that this rev dependent PNV has greater efficacy for 
antibody generation by ID vaccination although the rev -independent 
construct V 1 Jns-tPA-gp 1 20 did not. RHM receiving tPA-gp 1 60 DN A 
(IM) showed lower, more variable antibody responses than those 
receiving rev/env which corroborate our determination that this vector 
expresses gpl60 4-7X less efficiently than rev/env. 
B. In Vitro Virus Neutralization 

An infectivity reduction neutralization assay (p24 gag 
production readout) using HIV(MN) as a vims source was performed 
by Quality Biologicals, Inc. (QBI). At low vims input (100 TCID50) 
complete neutralization was seen at 1/10 dilutions of sera for all three 
antisera with at least 80-90 % reduction in vims production observed in 
all samples up to 1/80 dilutions as compared to matched prebleed sera. 
However, at higher vims input (1000 TCID50), no neutralization was 
observed for any sample. 

RHM were tested for HIV (IIIB) neutralization (QBI), 
using 100 TCID50 of input vims, following vaccinations with tPA- 
gpl20 (IIIB) DNA. In two different experiments the best neutralization 
results were obtained at serum dilutions of 10 (40-99% reduction of p24 
gag ) with gag reduction observed in some samples at dilutions as high 
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as 80-fold. The most consistent samples in this assay had anti-gp 1 20 
antibody ELISA endpoint titers of at least 2000-3000. 

RHM were similarly tested for HIV fHIB) neutralization 
(QBI) following vaccinations with rev/env DNA. Overall, low levels of 
neutralization were observed: two of three RHM showed neutralization 
ranging up to 84% at a serum dilution of 10 with p24 gag reduction 
observed at subsequent dilutions of 20 or 40 while one sample did not 
show any evidence of neutralization. These samples had anti-gp 120 
antibody ELISA titers of 700-800 indicating that this is the minimum 
useful titer range for testing sera derived from gpl60 DNA vaccine 
experiments in neutralization assays. 
C. Facilitators for Enhanced Immunity 

Several experiments were initiated to test plasmid DNA 
formulations which have been reported to enhance DNA uptake 
following vaccination and increase either reporter gene expression or 
immune responses in mouse or monkey vaccinees. Hypertonic sucrose 
(up to 20-25%, w/v) DNA solutions have been reported to give more 
uniform distribution of DNA uptake, as evidenced by reporter gene 
expression, and was used in experiments in which substantial gpl60- 
specific antibodies were elicited in rodents and nonhuman primates 
vaccinated with a rev/gpl60 plasmid. The anesthetic, bupivicaine (0.25- 
0.75%, w/v), has also been reported to significantly enhance DNA 
vaccine-mediated immune responses in mice and nonhuman primates 
when used either as a pretreatment for IM injection, or as by co- 
injection with DNA in isotonic saline solution. 

Our initial results with bupivicaine showed that substantial 
mortality was caused by IM treatment with 0.5% solutions. Mortality 
varied depending on the volume of solution used and whether the mice 
were injected while under anesthetic (> 0.J mL w/o anesthetic gave 
highest mortality). Our experiments have used 0.25% solutions 
without significant mortality either as a pre-treatment or a co-treatment 
and using gpl20 or rev/env PNVs. A preliminary experiment using 
bupivicaine as a pre-treatment for three vaccination rounds did not 
show any enhancement of immune responses relative to control mice 
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while a larger experiment using both ID and IM sites as a pre-ireatment 
or co-treatment has not shown any increased antibody levels following 
one injection and appeared to decrease antibody responses m some 
groups. Three vaccinations are planned in the current study. 

This sucrose formulation experiment tested a variety of 
conditions described in the literature. Sucrose concentration was tested 
at 10, 1 5. 20. and 25% in saline or PBS solution containing 0.1 mg/mL 
of tPA-gpI20 plasmid. All samples were tested as a co-injection by IM 
or ID routes except for a 25% sucrose/PBS group that received this 
solution 15-30 minutes prior to IM DNA/PBS injection. Serum data 
derived from bleeds following the first vaccination did not show any 
enhancement of antibody responses. 

EXAMPI.F 16 

T Lymphocyte Responses : 

A- Proliferatio n and Cytokine Secretion 

T lymphocytes which have been primed in vivo with 
antigen can proliferate and secrete cytokines during in vitro cell culture 
after exogenous addition of priming antigen. Responding T cells 
usually have a MHC Class fl-restricted, CD4+ (helper) phenotype. 
Helper T cells can be functionally grouped according to the types of 
cytokines they secrete following stimulation by antigen: THl cells 
secrete primarily IL-2 and g-interferon while Th2 cells are associated 
with IL-4, IL-5, and IL-10 secretion. ThI lymphocytes and cytokines 
promote cellular immunity, including CTL and DTH responses* while 
TH2 cells and cytokines promote B cell activation for humoral 
immunity. We have previously tested for these responses in mice and 
nonhuman primates (AGM and RHM), using rgpl20niB for antigen in 
vilm, after vaccination with HIV tPA-gpI20 PNVs and shown that T 
cells from vaccinees of both species exhibit proliferative responses to 
gpl20 in vitro and that these responses are THl -like and long-lived (> 6 
months) in mice. These studies were continued with a rev PNV. 

1. mouse studies: Mice vaccinated either 3X or IX with 
200 ng VUns-rev were tested for invitro proliferation to recombinant 
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rev (r-rev) protein. Mice vaccinated 3X showed stimulation indices (SI: 
ratio of proliferation of immune cells with and without immunizing 
antigen) of 9-12 while mice receiving IX were the same as background 
(Sis = 2-3). Splenic T cells from ail rev vaccinees, but not control 
mice, secreted g-interferon in response to r-rev antigen (2.4-2.8 ng/ml. 
3X; 0.4-0.7 ng/ml, IX) while no IL-4 was detected in culture 
supematants (detection sensitivities = 47 pg/ml and 15 pg/ml for g- 
interferon and IL-4, respectively) showing these T cell responses to be 
Th 1 -like in nature as we found for gpl20 DNA vaccinees. Cytokine 
secretion may be a more sensitive assay than proliferation to specific 
antigen for determining T cell memory responses. Similar results were 
found for mice tested at least six months post vaccination. Antibodies to 
rev were not detected in any vaccinee sera as may be expected for this 
intracellular protein. 

2. Monkey Studies : Three RHM showed strong in vitro T 
cell proliferation ( Sis = 9-30) to r-rev following two vaccinations with 
VUns-rev. No anti-rev antibodies were detected in any monkeys. 
These results corroborate the above mouse/rev experiments and confirm 
that strong T cell responses can be induced by rev PNVs without 
concomitant induction of antibody responses. 

Further experiments using tPA-gp]20 DNA vaccination of 
RHM showed that (i) in vitro T cell proliferation to rgpl20 was 
obtained following one vaccination; (ii) primary responses were boosted 
following a second vaccination; and, (iii) similar proliferations were 
obtained with these vaccinees as for SHIV-infected RHM (Sis = 5-70 and 
5-35, respectively). 

B. Anti-cm 1 Cytotoxic T Lymphocytes 

Two of four RHM monkeys vaccinated with tPA-gpl20 
(IM) and gpl60/IRES/rev (ID) PNVs showed significant CTL activities 
(> 20% lysis at 10:1 E/T) against homologous target cells six weeks 
following one vaccination. Two weeks post a second vaccination all 
four monkeys showed cytotoxicities ranging from 20 -35% lysis at 20:1 
E/T. All CTL activities in this assay design were MHC Class I 
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restricted: removal of CD8+ T cells completely removed cytotoxicities 
in all four monkeys. CTL responses waned over several months and 
were boosted to > original levels with subsequent re-vaccination. These 
CTL activities were characterized as the most potent for vaccine- 
mediated responses observed in RHM. 
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SEQUENCE LISTING 



: GENERAL INFORMATION: 

.1; APPLICANT: Shiver, John W 
Liu, Margaret A 
Perry, Helen C 

Ul ! TITLE OF INVENTION : COORDINATE IN VIVO GENE EXPRESSION 

iill> NUMBER OF SEQUENCES: 10 0 

i iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Christine E. Carty 

(BJ STREET: 126 Lincoln Avenue, P.O. Box 2000 

(C) CITY: Rahway 

( D ) STATE: New Jersey 

(E) COUNTRY : United States of America 

(F) ZIP : 07065 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(CI OPERATING SYSTEM : PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

!vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Carty, Christine E . 

(B) REGISTRATION NUMBER; 3 6,09 0 

<C) REFERENCE / DOCKET NUMBER: 19188Y 

(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE : (908) 594-6734 

(B) TELEFAX: (908) 594-4720 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 
(in) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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- 3" - 



FRAGMENT TYPE: internal 



ixi) SEQUENCE DESCRIPTION: SEC ID NO : 1 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie His He Gly Pro 
IS 10 15 

Gly Arg Ala Phe Tyr Thr Thr Gly Glu He He Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : both 

£ii) MOLECULE TYPE: peptide 

(lii) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Cys Thr Arg Pro Asn Tyr Asn Lys Arg Lys Arg He His lie Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Thr Thr Lys Asn He He Gly Thr He Arg Gin 

20 25 30 

Ala His Cys 
35 

2! INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 

i'lli) HYPOTHETICAL: NO 
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A.VTI -SENSE: NC 
FRAGMENT TYPE : internal 



(XI! SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Arg He Arg He Gin Arg 
15 10 15 

Gly Pro Gly Arg Ala Phe Val Thr He Gly Lys He Gly Asn Met Arg 
20 25 30 

Gin Ala His Cys 
35 

INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(iv) ANT I -SENSE : NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly He His He Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Thr Thr Gly Lys He He Gly Asn He Arg Gin 
20 25 30 

Ala His Cys 
35 

INFORMATION FOR SEQ ID NO : 5 : 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(Hi MOLECULE TYPE: peptide 



WO 95/24485 



PCT/XS95/02633 



ill' HYPOTHETICAL: NC 
!iv. ANTI -SENSE: NO 
fv.' FRAGMENT TYPE: internal 



(xi ; SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Cys Thr Arg Pro Ser A'sn Asn Asn Thr Arg Lys Ser lie His He Gly 
15 10 15 

Pro Gly Lys Ala Phe Tyr Ala Thr Gly Ala He He Gly Asp He Arg 
20 25 30 

Gin Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

{C; STRANDEDNESS : single 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Ser He His He Ala Pro 
15 10 15 

Gly Arg Ala Phe Tyr Ala Thr Gly Asp He He Gly Asp He Arg Gin 

20 25 30 

Ala His Cys 
35 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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i:> MOLECULE TYPE : cDNA 
[in. HYPOTHETI LAL : NO 

(iv) a:;t:- sense : no 

(xi : SEQUENCE DESCRIPTION: SEQ ID NO: 

CTATATAAGC AGAGCTCGTT TAG 

(21 INFORMATION FOR SEQ ID NO : 8 : 

( l ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3 0 base pairs 
(3) TYPE : nucleic acid 
<Z) STRANDEDNESS : both 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 
GTAGCAAAGA TCTAAGGACG GTGACTGCAG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

GTATGTGTCT GAAAATGAGC GTGGAGATTG GGCTCGCAC 

(2) INFORMATION FOR SEQ ID NO : 1 0 : 

(i! SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 9 base pairs 
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( B 1 TYPE: nucleic acid 
[~- STRANDEDNESS: both 
{Di TCFCLOGY: linear 

111! MOLECULE TYPE: cDNA 

iin) HYPOTHETICAL: NO 

!iv) ANTI -SENSE : NO 



(Xl) SEQUENCE DESCRIPTION: SEC ID NO: 10: 
GTGCGAGCCC AATCTCCACG CTCATTTTCA GACACATAC 3 9 

(2) INFORMATION FOR SEC. ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
[iii) HYPOTHETICAL: NO 
(IV) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTACAAGAT CTACTATAGG GAGACCGGAA TTCCGC 3 6 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 4432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : both 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE : NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 60 
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CAGCTTGTC7 


G7AAGCGGA7 


GCGGGGAGCA 


'»jAC aagcccg 


TCA3GGCGCG 


7CAGCGGG7G 




TTGGCGGGTG 


TCGGGGCTGG 


CTTAACTATG 


CGGCATCAGA 


GC AGATTGTA 


CTGAGAGTGC 


130 


ACCATATGCG 


GTGTGAAATA 


CCGCACAGAT 


GCGTAAGGAG 


AAAATACCGC 


ATCAGATTG3 


24 0 


CTATTGGCCA 


TTGCATACGT 


TGTATCCATA 


TCATAATATG 


TACAT7TATA 


TTGGCTCAT3 


j 03 


TCCAACATTA 


CCGCCATGTT 


GACATTGATT 


ATTGACTAG7 


TATTAATAGT 


AATCAATTA Z 


3 60 


GGGGTCATTA 


GTTCATAGCC 


CATATATGGA 


GTTCCGCGTT 


ACATAACTTA 


CGGTAAATGG 


420 


CCCGCCTGGC 


TGACCGCCCA 


ACGACCCCCG 


CCCATTGACG 


TCAATAATGA 


CGTATGTTCC 


4 80 


CATAGTAACG 


CCAATAGGGA 


CTTTCCATTG 


ACGTCAATGG 


GTGGAGTATT 


TACGGTAAAC 


540 


TGCCCACTTG 


GCAGTACATC 


AAGTGTATCA 


TATGCCAAGT 


ACGCCCCCTA 


TTGACGTCAA 


600 


TGACGGTAAA 


TGGCCCGCCT 


GGCATTATGC 


CCAGTACATG 


ACCTTATGGG 


ACTTTCCTAC 


660 


TTGGCAGTAC 


ATCTACGTAT 


TAGTCATCGC 


TATTACCATG 


GTGATGCGGT 


TTTGGCAGTA 


720 


CATCAATGGG 


CGTGGATAGC 


GGTTTGACTC 


ACGGGGATTT 


CCAAGTCTCC 


ACCCCATTGA 


780 


CGTCAATGGG 


AGTTTGTTTT 


GGCACCAAAA 


TCAACGGGAC 


TTTCCAAAAT 


GTCGTAACAA 


840 


CTCCGCCCCA 


TTGACGCAAA 


TGGGCGGTAG 


GCGTGTACGG 


TGGGAGGTCT 


ATATAAGCAG 


900 


AGCTCGTTTA 


GTGAACCGTC 


AGATCGCCTG 


GAGACGCCAT 


CCACGCTGTT 


TTGACCTCCA 


960 


TAGAAGACAC 


CGGGACCGAT 


CCAGCCTCCG 


CGGCCGGGAA 


CGGTGCATTG 


GAACGCGGAT 


1020 


TCCCCGTGCC 


AAGAGTGACG 


TAAGTACCGC 


CTATAGAGTC 


TATAGGCCCA 


CCCCCTTGGC 


1080 


TTCTTATGCA 


TGCTATACTG 


TTTTTGGCTT 


GGGGTCTATA 


CACCCCCGCT 


TCCTCATGTT 


1140 


ATAGGTGATG 


GTATAGCTTA 


GCCTATAGOT 


GTGGGTTATT 


GACCATTATT 


GACCACTCCC 


1200 


CTATTGGTGA 


CGATACTTTC 


CATTACTAAT 


CCATAACATG 


GCTCTTTGCC 


ACAACTCTCT 


1260 


TTATTGGCTA 


TATGCCAATA 


CACTGTCCTT 


CAGAGACTGA 


CACGGACTCT 


GTATITITAC 


1320 


AGGATGGGGT 


CTCATTTATT 


ATTTACAAAT 


TCACATATAC 


AACACCACCG 


TCCCCAGTGC 


1380 


CCGCAGTTTT 


TATTAAACAT 


AACGTGGGAT 


CTCCACGCGA 


ATCTCGGGTA 


CGTGTTCCGG 


1440 


ACATGGGCTC 


TTCTCCGGTA 


GCGGCGGAGC 


TTCTACATCC 


GAGCCCTGCT 


CCCATGCCTC 


1500 


CAGCGACTCA 


TGGTCGCTCG 


GCAGCTCCTT 


GCTCCTAACA 


GTGGAGGCCA 


GACTTAGGCA 


1560 


CAGCACGATG 


CCCACCACCA 


CCAGTGTGCC 


GCACAAGGCC 


GTGGCGGTAG 


GGTATGTGTC 


1620 


TGAAAATGAG 


CTC GGGGAGC 


GGGCTTGCAC 


CGCTGACGCA 


TTTGGAAGAC 


TTAAGGCAGC 


1680 


GGCAGAAGAA 


GATGCAGGCA 


GCTGAGTTGT 


TGTGTTCTGA 


TAAGAGTCAG 


AGGTAACTCC 


1740 
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CTGTTAACGG 


TGGAGGGCAG 


TGTAGTCTGA 


GCAGTACTCG 


TTGCTGCCGC 


: 8 o o 


GCGCGCCACC 


AGACATAATA 


GCTGACAGAC 


TAACAGACTG 


TTCCTTTCCA 


TGGGTCTTTT 


13 sc 


CTGCAGTCAC 


CGTCCTTAGA 


TCTGCTGTGC 


CTTCTAGTTG 


CCAGCCATCT 




1920 


CCTCCCCCGT 


GCCTTCCTTG 


ACCCTGGAAG 


GTGCCACTCC 


CACTGTCCTT 


TCCTAATAAA 


1930 


ATGAGGAAAT 


TGCATCGCAT 


TGTCTGAGTA 


GGTGTCATTC 


TATTCTGGGG 


GGTGGGGTGG 


2 0 4 0 


GGCAGCACAG 


CAAGGGGGAG 


GATTGGGAAG 


ACAATAGCAG 


GCATGCTGGG 


GATGCGGTGG 


J i 0 c 


GCTCTATGGG 


TACCCAGGTG 


CTGAAGAATT 


GACCCGGTTC 


CTCCTGGGCC 


AGAAAGAAGC 


1160 


AGGCACATCC 


CCTTCTCTGT 


GACACACCCT 


GTCCACGCCC 


CTGGTTCTTA 


GTTCCAGCCC 


:22 0 


CACTCATAGG 


ACACTCATAG 


CTCAGGAGGG 


CTCCGCCTTC 


AATCCCACCC 


GCTAAAGTAC 


:-2sq 


TTGGAGCGGT 


CTCTCCCTCC 


CTCATCAGCC 


CACCAAACCA 


AACCTAGCCT 


CCAAGAGTGG 


23 4 0 


GAAGAAATTA 


AAGCAAGATA 


GGC TATTAAG 


TGCAGAGGGA 


GAGAAAATGC 


CTCCAACATG 


2400 


TGAGGAAGTA 


ATGAGAGAAA 


TCATAGAATT 


TCTTCCGCTT 


CCTCGCTCAC 


TGACTCGCTG 


2460 


CGCTCGGTCG 


TTCGGCTGCG 


GCGAGCGGTA 


TCAGCTCACT 


CAAAGGCGGT 


AATACGGTTA 


2520 


TCCACAGAAT 


C AGGGG AT AA 


CGCAGGAAAG 


AACATGTGAG 


CAAAAGGCCA 


GCAAAAGGCC 


2580 


AGGAACCGTA 


AAAAGGCCGC 


GTTGCTGG CG 


TTTTTCCATA 


GGCTCCGCCC 


CCCTGACGAG 


2 64 0 


CATCACAAAA 


ATCGACGCTC 


AAGTCAGAGG 


TGGCGAAACC 


CGACAGGACT 


ATAAAGATAC 


2"Q0 


CAGGCGTTTC 


CCCCTGGAAG 


CTCCCTCGTG 


CGCTCTCCTG 


TTCCGACCCT 


GCCGCTTACC 


27 6 0 


GGATACCTGT 


CCGCCTTTCT 


CCCTTCGGGA 


AGCGTGGCGC 


TTTCTCAATG 


CTCACGCTGT 


2820 


AGGTATCTCA 


GTTCGGTGTA 


GGTCGTTCGC 


TCCAAGCTGG 


GCTGTGTGCA 


CGAACCCCCC 


2880 


GTTCAGCCCG 


ACCGCTGCGC 


CTTATCCGGT 


AACTATCGTC 


TTGAGTCCAA 


CCCGGTAAGA 


294C 


CACGACTTAT 


CGCCACTGGC 


AGCAGCCACT 


GGTAACAGGA 


TTAGCAGAGC 


GAGGTATGTA 


3 000 


GGCGGTGCTA 


CAGAGTTCTT 


GAAGTGGTGG 


CCTAACTACG 


GCTACACTAG 


AAGGACAGTA 


3 060 


TTTGGT ATC T 


GCGCTCTGCT 


GAAGCCAGTT 


ACCTTCGGAA 


AAAGAGTTGG 


TAGCTCTTGA 


3120 


TCCGGCAAAC 


AAACCACCGC 


TGGTAGCGGT 


GGTTT'I'TTTG 


TTTGCAAGCA 


GCAGATTACG 


3 180 


CGCAGAAAAA 


AAGGATCTCA 


AGAAGATCCT 


TTGATCTTTT 


CTACGGGGTC 


TGACGCTCAG 


3240 


TGGAACGAAA 


ACTCACGTTA 


AGGGATTTTG 


GTCATGAGAT 


TATCAAAAAG 


GATCTTCACC 


3300 


TAGATCCTTT 


TAAATTAAAA 


ATGAAGTTTT 


AAATCAATCT 


AAAGTATATA 


TGAGTAAACT 


33bO 


TGGTCTGACA 


GTTACCAATG 


CTTAATCAGT 


GAGGCACCTA 


TCTCAGCGAT 


CTGTCTATTT 


3420 
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CGTTCA7CCA TAG7TCCC 



AC7CCCCG77 G7GTAGA7AA CTACGA7ACG GGAGG( 



3 43 j 



CCATCTGGCr CCAGTGCTGC AATGATACCG CGAGACCCAC GCTCACCGGC TCCAGATTTA 



GCCTCCATCC 


AGTCTATTAA 


TTGTTGCCGG 


GAAGCTAGAG 


TAAGTAGTTC 


GCCAGTTAAT 


3660 


AG7TTGCGCA 


ACGTTGTTGC 


CATTGCTACA 


GGCATCGTGG 


TGTCACGCTC 


GTCGTTTGGT 


372 C 


ATGGCTTCAT 


TCAGCTCCGG 


TTCCCAACGA 


TCAAGGCGAG 


TTACATGATC 


CCCCATGTTG 


3 7 S 0 


TGCAAAAAAG 


CGGTTAGCTC 


CTTCGGTCCT 


CCGATCGTTG 


TCAGAAGTAA 




"3 B a r 
5 O 3 U 


GTGTTATCAC 


TCATGGTTAT 


GGCAGCACTG 


CATAATTCTC 


TTACTGTCAT 


GCCATCCGTA 


3 90 0 


AGATGCTTTT 


CTGTGACTGG 


TGAGTACTCA 


ACCAAGTCAT 


TCTGAGAATA 


GTGTATGCGG 


3960 


CGACCGAGTT 


GCTCTTGCCC 


GGCGTCAATA 


CGGGATAATA 


CCGCGCCACA 


TAGCAGAACT 


4020 


TTAAAAGTGC 


TCATCATTGG 


AAAACGTTCT 


TCGGGGCGAA 


AACTCTCAAG 


GATCTTACCG 


4080 


CTGTTGAGAT 


CCAGTTCGAT 


GTAACCCACT 


CGTGCACCCA 


ACTGATCTTC 


AGCATCTTTT 


4140 


ACTTTCACCA 


GCGTTTCTCG 


GTGAGCAAAA 


ACAGGAAGGC 


AAAATGCCGC 


AAAAAAGGGA 


4200 


ATAAGGGCGA 


CACGG AAA TG 


TTGAATACTC 


ATACTCTTCC 


TTTTTCAATA 


TTATTGAAGC 


4260 


ATTTATCAGG 


GTTATTGTCT 


CATGAGCGGA 


TACATATTTG 


AATG T ATTTA 


GAAAAATAAA 


43 2 0 


CAAATAGGGG 


TTCCGCGCAC 


ATTTCCCCGA 


AAAGTGCCAC 


CTGACGTCTA 


AGAAACCATT 


4380 


ATTATCATGA 


CATTAACCTA 


TAAAAATAGG 


CGTATCACGA 


GGCCCTTTCG 


TC 


4432 


(2) INFORMATION FOR SEQ ID NO: 13: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : both 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



TCAGCAATAA ACCAGCCAGC CGGAAGGGC 



'C GAGCGCAGAA GTGGTCCTGC AACTTTATCC 



3 50 0 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATTGGCTATT GGCCATTGCA TACGTTGTAT CCATATCATA ATATGTACAT TTATATTGGC 
TCATGTCCAA CATTACCGCC ATGTTGACAT TGATTATTGA CTAGTTATTA ATAGTAATCA 



60 
120 
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ATTACGGGGT 


CAT7AGTTCA 


TAGCCCATAT 


ATGGAGTTCC 


GCGTTACATA 


ACTTACGG7A 


1 3 C 


AATGGCCCGC 


CTGGCTGACC 


GCCCAACGAC 


CCCCGCCCAT 


TGACGTCAAT 


AATGACGTAT 


240 


GTTCCCATAG 


TAACGCCAAT 


AGGGACTT'TC 


CATTGACGTC 


AATGGGTGGA 


GTATTTACGG 


3 00 


TAAAC TGCCC 


ACTTGGCAGT 


ACATCAAGTG 


TATCATATGC 


CAAGTACGCC 


CCCTATTGAC 


360 


GTCAATGACG 


GTAAATGGCC 


CGCCTGGCAT 


TATGCCCAGT 


ACATGACCTT 


ATGGGACTTT 


420 


CCTACTTGGC 


AGTACATCTA 


CGTATTAGTC 


ATCGCTATTA 


CC ATGGTGAT 


GCGGTTTTGG 


4 30 


CAGTACATCA 


ATGGGCGTGG 


ATAGCGGTTT 


GACTCACGGG 


GATTTCCAAG 


TCTCCACCCC 


540 


ATTGACGTCA 


ATGGGAGTTT 


GTTTTGGCAC 


CAAAATCAAC 


GGGACTTTCC 


AAAATGTCGT 


60 0 


AACAACTCCG 


CCCCATTGAC 


GCAAATGGGC 


GGTAGGCGTG 


TACGGTGGGA 


GGTCTATATA 


66 0 


AGCAGAGCTC 


GTTTAGTGAA 


CCGTCAGATC 


GCCTGGAGAC 


GCCATCCACG 


CTGTTTTGAC 


^20 


CTCCATAGAA 


GACACCGGGA 


CCGATCCAGC 


CTCCGCGGCC 


GGGAACGGTG 


CATTGGAACG 


78 0 


CGGATTCCCC 


GTGCCAAGAG 


TGACGTAAGT 


ACCGCCTATA 


GAGTCTATAG 


GCCCACCCCC 


840 


TTGGCTTCTT 


ATGCATGCTA 


TACTGTTTTT 


GGCTTGGGGT 


CTATACACCC 


CCGCTTCCTC 


9 00 


ATGTTATAGG 


TGATGGTATA 


GCTTAGCCTA 


TAGGTGTGGG 


TTATTGACCA 


TTATTGACCA 


960 


CTCCCCTATT 


GGTGACGATA 


CTTTCCATTA 


CTAATCCATA 


ACATGGCTCT 


TTGCCACAAC 


10 20 


TCTCTTTATT 


GGCTATATGC 


CAATACACTG 


TCCTTCAGAG 


ACTGACACGG 


ACTCTGTATT 


1080 


TTTACAGGAT 


GGGGTCTCAT 


TTATTATTTA 


CAAATTCACA 


TATACAACAC 


CACCGTCCCC 


1140 


AGTGCCCGCA 


GTTTTTATTA 


AACATAACGT 


GGGATCTCCA 


CGCGAATCTC 


GGGTACGTGT 


1200 


TCCGGACATG 


GGCTCTTCTC 


CGGTAGCGGC 


GGAGCTTCTA 


CATCCGAGCC 


CTGCTCCCAT 


1260 


GCCTCCAGCG 


ACTCATGGTC 


GCTCGGCAGC 


TCCTTGCTCC 


TAACAGTGGA 


GGCCAGACTT 


1320 


AGGCACAGCA 


CGATGCCCAC 


CACCACCAGT 


GTGCCGCACA 


AGGCCGTGGC 


GGTAGGGTAT 


1380 


GTGTCTGAAA 


ATGAGCTCGG 


GGAGCGGGCT 


TGCACCGCTG 


ACGCATTTGG 


AAGACTTAAG 


1440 


GCAGCGGCAG 


AAGAAGATGC 


AGGCAGCTGA 


GTTGTTGTGT 


TCTGATAAGA 


GTCAGAGGTA 


1500 


ACTCCCGTTG 


CGGTGCTGTT 


AACGGTGGAG 


GGCAGTGTAG 


TCTGAGCAGT 


ACTCGTTGCT 


1560 


GCCGCGCGCG 


CCACCAGACA 


TXATAGCTGA 


CAGACTAACA 


GACTGTTCCT 


TTCCATGGGT 


1620 


CTTTTCTGCA 


GTCACCGTCC 


TTAGATCTGC 


TGTGCCTTCT 


AGTTGCCAGC 


CATCTGTTGT 


1680 


TTGCCCCTCC 


CCCGTGCCTT 


CCTTGACCCT 


GG AAGGTGCC 


ACTCCCACTG 


TCCTTTCCTA 


1740 


ATAAAATGAG 


GAAATTGCAT 


CGCATTGTCT 


GAGTAGGTGT 


CATTCTATTC 


TGGGGGGTGG 


1800 
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GGTGGGGCAG CACAGCAAGG GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC 13 5; 

GGTGGGCTCT ATGGGTAGCC AGCTGCTGAA GAATTGACCC GGTTCCTCCT GGGCCAGAAA 
GAAGCAGGCA CATCCCCTTC TCTGTGACAC ACCCTGTCCA GGCCCCTGGT TCTTAGTTCC 
A3CCCCACTC ATAGGACACT CATAGCTCAG GAGGGCTCCG CCTTCAATCC CAGCCGCTAA 204 C 
AGTACTTGGA GCGGTCTCTC CCTCCCTCAT CAGCCCACCA AACCAAACCT ACCCTCCAAG 
AGTGGGAAGA AATTAAAGCA AGATAGGCTA TTAAGTGCAG AGGGAGAGAA AATGCCTCCA 
ACATGTGAGG AAGTAATGAG AGAAATCATA GAATTC 
(2) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
(IV) ANTI -SENSE : NO 



192C 
i960 



:■ 1 0 C 
1160 
2196 



ixi} SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 60 

CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 12 0 

TTGGCGGGTG TCGGOGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA CTGAGAGTGC 180 
ACCATATGCG GTGTGAAATA CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGATTGG 
CTATTGGCCA TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 

TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTAGT TATTAATAGT AATCAATTAC 3 60 

GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT ACATAACTTA CGGTAAATGG 42 C 

CCCGCCTGGC TGACCGCCCA ACGACCCCCG CCCATTGACG TCAATAATGA CGTATGTTCC 48 0 

CATAGTAACG CCAATAGGGA CTTTCCATTG ACGTCAATGG GTGGAGTATT TACGGTAAAC 54 0 

TGCCCACTTG GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 60 0 
TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG ACCTTATGGG ACTTTCCTAC 
TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG GTGATGCGGT TTTGGCAGTA 



240 

300 



660 

720 
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CATCAAT3GG CGTGGATAGC GGTTTGACTC ACGGGGATTT CCAAGTCTCC ACCCCATTGA 7 3C 

CGTCAAT jGG AGTTTGTTTT GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA 34 C 

ctccgcczca ttgacgcaaa tgggcggtag gcgtgtacgg tgg-3aggtct atataagcag 90 0 

agctcgttta gtgaaccgtc agatcgcctg gagacgccat cca igctgtt ttgacctcca 9 6g 

tagaaga:ac cggga;cgat ccagcctccg cggccgggaa cggtgcattg gaacgcggat io:o 

tccccqt^cc aagagtgacg taagtaccgc ctatagagtc tataggccca cccccttggc 10&0 

ttcttatgca tgctatactg tttttggctt ggggtctata cac2cccgct tcctcatgtt 114 0 

ATAGGTGATG GTATAGCTTA GCCTATAGGT GTGGGTTATT GAC3ATTATT GACCACTCCC 12 0 0 

CTATTGGTGA CGATACTTTC CATTACTAAT CCATAACATG GCTCTTTGCC ACAACTCTCT 12 6 0 

TTATTGGCTA TATGCCAATA CACTGTCCTT CAGAGACTGA CACGGACTCT GTATTTTTAC 13: 0 

AGGATGGGGT CTCATTTATT ATTTACAAAT TCACATATAC AACACCACCG TCCCCAGTGC 13&0 

CCGCAGTTTT TATTAAACAT AACGTGGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG 144 0 

ACATGGGCTC TTCTCCGGTA GCGGCGGAGC TTCTACATCC GAGCCCTGCT CCCATGCCTC 150 0 

CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGGCCA GACTTAGGCA 1560 

CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC GTGGCGGTAG GGTATGTGTC 167.0 

TGAAAATGAG CTCGGGGAGC GGGCTTGCAC CGCTGACGCA TTTGG AAG AC TTAAGGCAGC 168C 

GGCAGAAGAA GATGCAGGCA GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTC ■: 17 4 0 

CGTTGCGGTG CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 1 8 Cf 0 

GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA TGGGTCTTTT 186 0 

CTGCAGTCAC CGTCCTTAGA TCTGCTGTGC CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 1920 

CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG GTGCCACTCC CACTGTCCTT TCCTAATAAA 19 8 0 

ATGAGGAAAT TGCATCGCAT TGTCTGAGTA GGTGTCATTC TATTCTGGGG GGTGGGGTGG 20 4 0 

GGCAGCACAG CAAGGGGGAG GATTGGGAAG ACAATAGCAG GCATGCTGGG GATGCGGTGG 210 0 

GCTCTATGGG TACCCAGGTG CTGAAGAATT GACCCGGTTC CTCCTGGGCC AGAAAGAAGC 2160 

AGGCACATCC CCTTCTCTGT GACACACCCT GTCCACGCCC CTGGTTCTTA GTTCCAGCCC 2220 

CACTCATAGG ACACTCATAG CTCAGGAGGG CTCCGCCTTC AATCCCACCC GCTAAAGTAC 2280 

TTGGAGCGGT CTCTCCCTCC CTCATCAGCC CACCAAACCA AACCTAGCCT CCAAGAGTGG 2 3 40 

GAAGAAATTA AAGCAAGATA GGC T ATT AAG TGCAGAGGGA GAGAAAATGC CTCCAACATG 2 40 0 



W O 95/24485 



PCT7US95/02633 



TGAGGAAGTA A TG AG AG AAA TCATAGAATT TCTTCCGC7T CCTCGCTCAC TGACTCGCTG 24 60 

CGCTCGGTGG TTC3GCTGCG GCGAGCGGTA TCAGCTCACT CAAAGGCGGT AATACGGTTA 252 C 

TCGAGAGAAT CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA GCAAAAGGCC 2 58 0 

AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC CCCTGACGAG 2 64 0 

CATCAGAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT ATAAAGATAC 27 OC 

CAGGCGTTTC CCCCTCGAAG CTCCCTCGTC CGCTCTCCTG TTCCGACCCT GCCGCTTACC 27 6 C 

GGATACGTGT CCGCCTTTCT CCCTTCGGGA AGCGTGGCGC TTTCTCAATG CTCACGCTGT 2 82 0 

AGGTATCTCA GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 288 0 

GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA CCCGGTAAGA 294 0 

CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC GAGGTATGTA 3 00 0 

GGCG3TGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG GCTACACTAG AAGGACAGTA 3 0 60 

TTTGGTATCT GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA 312 0 

TCCGGCAAAC AAACCACCGC TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG 3180 

CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC TGACGCTCAG 3 24 0 

TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TATCAAAAAG GATCTTCACC 3 300 

TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT AAAGTATATA TGAGTAAACT 3 3 60 

TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA TCTCAGCGAT CTCTCTATTT 3 42 0 

CGTTCATCCA TAGTTGCCTG ACTCCGGGGG GGGGGGGCGC TGAGGTCTGC CTCGTGAAGA 3 48 0 

AGGTGTTGCT GACTCATACC AGGCCTGAAT CGCCCCATCA TCCAGCCAGA AAGTG AGGG A 3 54 0 

GCCACGGTTG ATGAGAGCTT TGTTGTAGGT GGACCAGTTG GTGATTTTGA ACTTTTGCTT 3 60 0 

TGCCACGGAA CGGTCTGCGT TGTCGGGAAG ATGCGTGATC TGATCCTTCA ACTCAGCAAA 3 66 0 

AGTTCGATTT ATTCAACAAA GCCGCCGTCC CGTCAAGTCA GCGTAATGCT CTGCCAGTGT 37 2 0 

TACAACCAAT TAACCAATTC TGATTAGAAA AACTCATCGA GCATCAAATG AAACTGCAAT 37 80 

TTATTCATAT CAGGATTATC AATACCATAT TTTTGAAAAA GCCGTTTCTG TAATGAAGGA 3 84 0 

GAAAACTCAC CGAGGCAGTT CCATAGGATG GC AAGATCCT GGTATCGGTC TGCGATTCCG 3 90 0 

ACTCGTCCAA CATCAATACA ACCTATTAAT TTCCCCTCGT CAAAAATAAG GTTATCAAGT 3 9 60 

GAGAAATCAC CATGAGTGAC GACTGAATCC GGTGAGAATG GCAAAAGCTT ATGCATTTCT 4 02 0 

TTCCAGACTT GTTCAACAGG CCAGCCATTA CGCTCGTCAT CAAAATCACT CGCATCAACC 4 08 0 
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414: 

4 2 00 
4260 
4320 



AAACCGTTA? TCATTCGTGA TTGCGCCTGA GCGAGACGAA ATACGCGATC GCTGTTAAAA 
GGACAATTAC AAACAGGAAT CGAATGCAAC CGGCGCAGGA ACACTGCCAG CGCATCAACA 
ATATTTTCAC CTGAATCAGG ATATTCTTCT AATACCTGGA ATGCTGTTTT CCCGGGGATC 
GCAGTGGTGA GTAACCATGC ATCATCAGGA GTACGGATAA AATGCTTGA7 GGTCGGAAGA 
GGCATAAATT CCGTCAGCCA GTTTAGTCTG ACCATCTCAT CTGTAACATC ATTGGCAACG 43 SC 

CTACCTTTGC CATGTTTCAG AAACAACTCT GGCGCATCGG GCTTCCCATA CAATCGATAG 44 4 0 

ATTGTCGCAC CTGATTGCCC GACATTATCG CGAGCCCATT TATACCCATA TAAATCAGCA 45 0 0 
TCCATGTTGG AATTTAATCG CGGCCTCGAG CAAGACGTTT CCCGTTGAAT ATGGCTCATA 45 60 

ACACCCCTTG TATTACTGTT TATGTAAGCA GACAGTTTTA TTGTTCATGA TGATATATTT 4 620 
TTATCTTGTG CAATGTAACA TCAGAGATTT TGAGACACAA CGTGGCTTTC CCCCCCCCCC 4 6 80 

CATTATTGAA GCATTTATCA GGGTTATTG? CTCATGAGCG GAT AC AT ATT TGAATGTATT 4 74 0 

TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC 4 80 0 

TAAGAAACCA TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC GAGGCCCTTT 4860 
CGTC 

(2) INFORMATION FOR SEQ ID NO : 1 5 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 41 base pairs 
CB) TYPE: nucleic acid 
CO STRANDEDNESS: both 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



4864 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCACATAGAT CTGTTCCATG GTTGTGGCAA TATTATCATC G 
(2) INFORMATION FOR SEQ ID NO : 1 6 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
!C) STRANDEDNESS: both 
(D) TOPOLOGY: linear 
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i:> MOLECULE TYPE: cDNA 
in; HYPOTHETICAL : NO 
!iv; ANTI -SENSE: NO 



(xi ; SEQUENCE DESCRIPTION: SEC ID NO : 1 6 : 
GGTACAAGAT CTACCATGGC AGGAAGAAGC GGAGACAGC 
(2! INFORMATION FOR SEQ ID NO: 17; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
( C } STRANDEDNES S : bo t h 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17 : 
CC AC AT AG AT CTGATATCGC ACTATTCTTT AGCTCCTGAC TCC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY : linear 

(ll) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18; 
GATCACCATG GATGCAATGA AGAGAGGGCT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGA 
(2; INFORMATION FOR SEQ ID NO : 1 9 : 
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; i SEQUENCE CHARACTERISTICS : 
■A; LENGTH: 7 8 base pairs 
(3: TYPE: nucleic acid 
: Z : STRANDEDNESS : both 
( D ) TOPOLOGY: linear 

'lit MOLECULE TYPE : cDNA 

ill) HYPOTHETICAL : NO 

;:vi ANTI -SENSE: NO 



(xil SEQUENCE DESCRIPTION: SEQ. ID NO: 19: 

GATCTCGCTG GGCGAAACGA AGACTGCTCC ACACAGCAGC AGCACACAGC AGAGCCCTCT 6 0 

CTTCATTGCA TCCATGGT 7 8 
(2) INFORMATION FOR SEQ ID NO: 20: 

li) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGTACATGAT CAGATATCGC CCGGGCCGAG ATCTTCAGAC TTGGAGGAGG AG 5 2 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 
(iii! HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 
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ixi; SEQUENCE DESCRIPTION: SEQ ID NO : 2 I : 

CCACATTGA7 CAGCTTGTGT AATTGTTAAT TTCTCTGTCC 

(2; INFORMATION FOR SEQ ID NO: 22: 

li) SEQUENCE CHARACTERISTICS: 
(A! LENGTH: 39 base pairs 
(3) TYPE: nucleic acid 
id STRANDEDNESS: both 
(D) TOPOLOGY: linear 

In) MOLECULE TYPE : cDNA 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCCCGGATCC TGATCACAGA AAAATTGTGG GTCACAGTC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

( i l ) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 
CCCCAGGAAT CCACCTGTTA GCGCTTTTCT CTCTGCACCA CTCTTCTC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 



(in) HYPOTHETICAL: NO 
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ANTI -SENSE : NO 



'.XI' SEQUENCE DESCRIPTION: SEQ ID NC;24: 

GGTACATGAT CACAGAAAAA TTGTGGGTCA CAGTC 

2 INFORMATION FOR SEQ ID NO: 25: 

■i> SEQUENCE CHARACTERISTICS: 
(A! LENGTH: 4 7 base pairs 

(B) TYPE: nuclei? acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : linear 

111) MOLECULE TYPE: cDNA 

HYPOTHETICAL: NO 
(iV; ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CC AC ATTGAT CAGATATCTT ATCTTTTTTC TCTCTGCACC ACTCTTC $1 

(2) INFORMATION FOR SEQ ID NO: 26: 

(l) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3 8 base pairs 
!B) TYPE : nucleic acid 

(C) STRANDEDNESS both 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL: NO 
liv! ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTCACCGTCC TCTATCAAAG CAGTAAGTAG TACATGCA 
(2) INFORMATION FOR SEQ ID NO: 27: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
;C) STRANDEDNESS: both 
(D) TOPOLOGY: linear 
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MCLEC'JLE TYPE : cDNA 
HYPOTHETICAL: NO 
ANTI- SENSE: NC 



I xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 2^: 

T3TA0TACTT ACTGCTTTGA TAGAGGACGG TGACTGCA 

\2\ INFORMATION FOR SEQ ID NO:2£: 

! l ! SEQUENCE CHARACTERISTICS : 
(A; LENGTH: 4 0 base pairs 
(3) TYPE: nucleic acid 
;C; STRANDEDNESS : both 
!D) TOPOLOGY: linear 

ill! MOLECULE TYPE : cDNA 

(Hi) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GGTACATGAT CAACCATGAG AGTGAAGGAG AAATATCAGC 
(2) INFORMATION FOR SEQ ID NO: 29: 

( l ! SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CCACATTGAT CAGATATCCC CATCTTATAG CAAAATCCTT TCC 43 

(2} INFORMATION FOR SEQ ID NO : 3 0 : 

;:■ SEQUENCE CHARACTERISTICS : 
(A* LENGTH : 43 base pairs 
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'3- TYPE: nucleic acid 
C STRANDEDNESS: bot'r. 
E' TOPOLOGY : linear 

ill; MOLECULE TYPE: cDNA 

m: HYPOTHETICAL: NO 

iiv.! ANTI- SENSE: NO 



;xi> SEQUENCE DESCRIPTION: SEQ ID NO : 3 C : 

CCACATTGAT CAGATATCCC CATCTTATAG CAAAATCCTT TCC 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 

il! SEQUENCE CHARACTERISTICS: 
( A } LENGTH: 43 base pairs 
(3) TYPE: nucleic acid 
!C) STRANDEDNESS : both 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(in) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
GGTACACTGC AGTCACCGTC CTATGGCAGG AAGAAGCGGA GAC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
fill) HYPOTHETICAL : NO 
iiv) ANTI-SENSE: NO - 



(Xi? SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCACATCAGG TACCCCATAA TAGACTGTGA CC 



32 
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INFORMATION FCF SEQ ID NC : 3 3 : 

SEQUENCE CHARACTERISTICS : 
■'.A; LENGTH: 45 base pairs 
;B) TYPE: nucleic acid 
(Ci STRANDEDNESS : both 
(D! TOPOLOGY: linear 

: ii! MOLECULE TYPE: cDNA 

(ill! HYPOTHETICAL: NC 

i iv) AToTI - SENSE : NO 



(XI) SEQUENCE DESCRIPTION : SEQ ID NO: 33: 
GGTACAAGAT CTACCATGGG ACCAGTACAA CAAATAGGTG GTAAC 
12) INFORMATION FOR SEQ ID NO : 3 4 : 

( l ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : linear 

111! MOLECULE TYPE : cDNA 
(in) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCACATAGAT CTTTACATTA ATCTAGCCTT CTGTCCC 
(2) INFORMATION FOR SEQ ID NO : 3 5 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(in) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 
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.XI : SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 

3GTACAACCA TGGGTGGAGC TATTTCCATG AGG 

:2> INFORMATION FOR SEQ ID NO : 3 6 : 

:i] SEQUENCE CHARACTERISTICS : 
(A.) LENGTH: 29 base pairs 
(3) TYPE : nucleic acid 
(C) STRANDEDNESS: both 
;d; TOPOLOGY, linear 

in) MOLECULE TYPE; cDNA 

liiil HYPOTHETICAL: NO 

i iv) ANTI -SENSE : NO 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 
CCTAGGTTAG CCTTCTTCTA ACCTCTTCC 

INFORMATION FOR SEQ ID NO: 37: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
CD) TOPOLOGY: linear 

( l i ) MOLECULE TYPE : cDNA 

(Hi) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO : 37 : 
GGTACAAGAT CTACCATGGG ATGTCTTGGG AATCAGC 
(2) INFORMATION FOR SEQ ID NO: 38: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

Hi' 1 MOLECULE TYPE : cDNA 



ill' HYPOTHETICAL : NO 
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ANTI -SENSE: NO 

',xi : SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 
CCACATAGAT CTGATATCGT ATGAGTCTAC TGGAAATAAG AGG 
■2: INFORMATION FOR SEQ ID NO : 3 9 : 

f i! SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

lii! MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
iiv) ANTI-SENSE: NO 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 
GGTACAGGAT CCACCATGGG TGCGAGAGCG TCAGTATTAA GC 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
(IV) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CCACATGGAT CCGCCCGGGC TTACATCTCT GTACAAATTT CTACTAATGC 

(2) INFORMATION FOR SEQ ID NO: 41: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 
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.ll' MOLECULE TYPE: cDNA 
ill HYPOTHETICAL: NC 
:iv; ANT I -SENSE; NO 



!xu SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 

GGTACAGGAT CCCCGCACGG CAAGAGGCGA GGG 

:2> INFORMATION FOR SEQ ID NO: 42: 

ii) SEQUENCE CHARACTERISTICS: 
(A.) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNFSS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(lii) HYPOTHETICAL: NO 
I iv) ANTI-SENSE: NO 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GGTACAGGAT CCACCATGGC TGCGAGAGCG TCAGTATTAA GC 
(2) INFORMATION FOR SEQ ID NO: 43: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CCACATGGAT CCGCCCGGGC CTTTATTGTG ACGAGGGGTC GTTGC 45 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i! SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 3 3 base pairs 
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I3-' TYPE r nucleic acid 

■■ z strandedness : both 

[Z- TCPCLCGY: linear 
fii' MOLECULE TYPE : cDNA 
ill) HYPOTHETICAL: NO 
'iv ANT I -SENSE: NO 



ixi: SEQUENCE DESCRIPTION : SEQ ID NO: 44: 
GGTACAGGAT CCCCGCACGG CAAGAGGCGA GGG 
\2) INFORMATION FOR SEC ID NO : 4 5 : 

ii! SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS - both 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE : cDNA 
(ill) HYPOTHETICAL: NO 
(iv! ANT I -SENSE : NO 



{xi I SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGTACAGGAT CCACCATGGC TGCGAGAGCG TCAGTATTAA GC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

!ll) MOLECULE TYPE: cDNA 
( i i l ) HYPOTHETICAL : NO 
iiv] ANTI-SENSE: NO 



'Xll SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GTACCTCATG AGCCACATAA TACCATG 



27 
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2' INFORMATION FOR SEQ ID NO: 47: 

i: SEQUENCE CHARACTERISTICS : 
A 40 base pairs 

(B) TYPE: nucleic acid 
■'CI STRANDEDNESS: both 
<D! TOPOLOGY : linear 

Hi) MOLECULE TYPE: cDNA 

(in! HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



ixi) SEQUENCE DESCRIPTION: SEQ ID NC:47: 
GGTACAAGAT CTACCATGGC TTGCAATTGT CAGTTGATGC 4 0 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv> ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CC AC AT AG AT CTCCATGGGA ACTAAAGGAA GACGGTCTGT TC 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
iivi ANTI -SENSE: NO 
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x: ' SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

G37ACAAGA? CTACCATGAA GGCAAACCTA CTG3TCCTG 

2 IMFORMATI ON FOR SEQ ID NO: 50: 

I i ; SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 46 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

:il) MOLECULE TYPE: cDNA 

HYPOTHETICAL * NO 
il v) ANT I -SENSE : NO 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CC AC ATAGAT CTGATATCCT AATCTCAGAT GCATATTCTG CACTGC 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: peptide 
till) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Arg He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Lys Asn 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : hot h 

(D) TOPOLOGY, linear 

(ii) MOLECULE TYPE* cDNA 
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■:ivi Aim- SENSE: NC 

(Xl i SEQUENCE DESCRIPTION: SEQ ID NO: 52 

GAAAGAGCAG AAGACAGTGG CAATGA 

■2; INFORMATION FOR SEQ ID NO : 5 3 : 

( i ; SEQUENCE CHARACTERISTICS : 
(A! LENGTH: 40 base pairs 
■;B? TYPE: nucleic acid 
(C? STRANDEDNESS : both 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(in) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
GGGCTTTGCT AAATGGGTGG CAAGTGGCCC GGGCATGTGG 
(2! INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(iit MOLECULE TYPE: cDNA 

HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
CTAACAGACT GTTCCTTTCC ATG 
(2) INFORMATION FOR SEQ ID NO: 55: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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S7RANEECNE53 : both 
70F0L3G7: linear 



MOLECULE TYPE: cDNA 



ill 



HYPOTHETICAL 



MO 



IS- 



ANTI -SENSE: fiC 



SEQUENCE DESCRIPTION: SECj ID NO: 55: 



GGAGTGGCAC CTTCCAGG 



IS 



INFORMATION FOR SEC ID NO: 56: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
!D) TOPOLOGY: both 

MOLECULE TYPE : cDNA 

(lii) HYPOTHETICAL: NO 

(lv) ANTI -SENSE: NO 



(xi ! SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GGAGACAGCG ACGAAGACCT CCTCAAGGCA GTCAGACTCA TCAAG 4 5 

(21 INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(ili) HYPOTHETICAL : NO 
[jvl ANTI-SENSE: NO 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 57; 
GATGGCTGGC AACTAGAAGG CACAGCAGAT CTGATATCGC ACTATTCTTT AGCTCCTGAC 6 0 

TCCAATATTG T -m 
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information fcp sec id no:53: 

ii; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 
IB) TYPE: nucleic acid 
iC) STRANDEDNESS : both 
■D> TOPOLOGY; both 

in.) MOLECULE TYPE : cDNA 

(ill ) HY POTHET I CAL : NO 

(iv; ANTI -SENSE: NO 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CTTAGATCAA CCATGAGAGT GAAGGAGAAA TAT C AG C ACT TGTGGAGATG GGGGTGGAGA 60 
TGGGGCACCA TGCTCCTTGG GATGTTGATG ATCTGTAGTG CTACAGAAAA ATTGTGGGT 119 
[2] INFORMATION FOR SEQ ID NO: 59: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : both 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

(lv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

CTGGCAACTA GAAGGCACAG CAGATCAGAT AGTGTCCCCA TCTTATAGCA AAATCCTTTC 

CAAGCCCTGT CTTATTCT 

!2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(in) HYPOTHETICAL: NO 
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SEQUENCE DESCRIPTION: SEQ ID NO : 6 

GATTTA3GTG ACACTATAG 

■2' INFORMATION FOR SEC ID NO : 6 1 r 

! l .) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 2 0 base pairs 
IB) TYPE: nucleic acid 
fC) STRANDEDNESS : both 
(Dj TOPOLOGY: both 

in: MOLECULE TYPE: cDNA 

fill) HYPOTHETICAL: NO 

;iv) ANT I -SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
TAATACGACT CACTATAGGG 
!2; INFORMATION FOR SEQ ID NO: 62: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(li) MOLECULE TYPE: cDNA 
(iiii HYPOTHETICAL: NO 
!iv} ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 62 
CATGCCTGCA GGTCGACTCT AAATTCCG 
(2! INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : both 
(D) TOPOLOGY: both 
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i : ■ MOLECULE TYPE : cDNA 
ill) HYPOTHETICAL : NO 
(IV) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 63: 
ACCCGGGGAT CCTCTAGCGC GCTTGTCTCT TGTTCCA 
(2 1 INFORMATION FOR SEQ ID NO: 64: 

!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi} SEQUENCE DESCRIPTION : SEQ ID NO: 64: 
GGGACGTGGT TTTCC 

(2) INFORMATION FOR SEQ ID NC:65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 
(ni) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
TATGGCCACA ACCATGGCAG GAAGAAGCGG AGACAGCGAC GAAGACCTCC TCAAGGCAGT 6 0 

CAGACT 

66 

(2; INFORMATION FOR SEQ ID NO : 66 : 
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; se;"e:;ce characteristics: 

:A- LENGTH: 9 0 base pairs 

;B> TYPE : nucleic acid 

:d STRANDEDNESS: both 

;D! TOPOLOGY: both 

MOLECULE TYPE: cDNA 

(ill) HYPOTHETICAL: NO 

liv; ANTI -SENSE: NO 



!xi! SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CTCGAGCCAT GGGCCCCTAG ACTATAGCGT GATAAGAAAT CGAGGACTGA GGTTATAACA 6 0 

TCCTCTAAGG TGGTTATAAA CTCCCGAAGG 9C 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

ill) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
!iv) ANTI -SENSE: NO 



(XI) SEQUENCE DESCRIPTION : SEQ ID NO: 67: 
GCTTCGGCCA GTAACG 

(2: INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS: both 
<D) TOPOLOGY: both 

{ii} MOLECULE TYPE: cDNA 

fill> HYPOTHETICAL: NO 



ANTI - SENSE : NO 
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>:i -sr-QUENCE DESCRIPTION: 3EC_ IH NO: 6 5 : 
TTGCATGCCT GCAGGTGGTA CATGATCAGA TATCGCCCGG GCCGAGATCT TCAGACTTGG 6 0 

AGGAGGAGAT ATGAGGGACA ATTGGAG 3^ 
'2: INFORMATION FOR SEp ID NO: 69: 

( ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 79 base pairs 

(B) TYPE: nu:leic acid 
;C) STRANDEDNESS: both 
(D) TOPOLOGY: both 

: ii) MOLECULE TYPE: cDNA 

iii) HYPOTHETICAL: NO 

i iv] ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GGGGCGGAAT TTAGAGTCAA TTGATCAGCT TGTGTAATTG TTAATTTCTC TGTCCCACTC 6 0 

CATCCAGGTC GTGTGATTC 7 9 

■2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

iii) MOLECULE TYPE : CDNA 
(iii) HYPOTHETICAL: NO 
: iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCATCTCCAC AAGTGCTG 18 
(2) INFORMATION FOR SEQ ID NO : 7 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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■ : : MOLECULE TYPE : cDNA 
(ill- HYPOTHETICAL: NO 
(iv; ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 I : 
AGATCTAAGG ACGGTGACTG CATGTACTAC TTACTGCTTT GATAGAGGAC GGTGACTGCA 6 0 

GAAAAGACCC ATGGAAA 

;2) INFORMATION FOR SEQ ID NO : 7 2 : 

{ i ! SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANCEDNESS: both 

(D) TOPOLOGY: both 

(li) MOLECULE TYPE : cDNA 
(ill) HYPOTHETICAL : NO 
(IV) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GCTTCGGCCA GTAACG X £ 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 62 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND ED NESS : both 

(D) TOPOLOGY : both 

(li) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 3 : 
GGCACAGCAG ATCAGATGGG GATCTGATAT CGCACTATTC TTTAGCTCCT GACTCCTGAC 6 0 

iC 62 
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2 INFORMATION FOP. 3EQ ID NO: 74: 



U; SEQUENCE CHARACTERISTICS : 
(Aj LENGTH: 42 base pairs 
(B) TYPE: nucleic acid 
JO STRANDEDNESS: both 
(D! TOPOLOGY: both 

MOLECULE TYPE : cDrJA 

ill*; HYPOTHETICAL : NO 

! :v! ANTI -SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GGAATTTGAG TCATCCCCAT CTTATAGCAA AATCCTTTCC AA 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
{ B ) TYPE: nucleic acid 
(C) STRANDEDNESS ; both 
<D) TOPOLOGY: both 

Ui; MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CTTAGATCCC CGCACGGCAA GAGGCGAGGG GCGGCGACTG GT 

(2) INFORMATION FOR SEQ ID NO: 76: 

ti) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 70 base pairs 
<B} TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 
fill) HYPOTHETICAL: NO 



(iv) ANTI -SENSE : NO 
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SEQUENCE DESCRIPTION: SEQ ID 



GGCACAGCAG ATCCGCCCGG GCTTACATCT CTGTACAAAT TTCTACTAAT GCTTTTATTT 

m rr f ^ ^ *T< f— 



:2! INFORMATION FOR SEQ ID NO: 77: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 70 base pairs 
(3] TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

molecule type: cdna 

(iii) HYPOTHETICAL: NC 
(iv! ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CTTAGATCCA CCATGGGTGC GAGAGCGTCA GTATTAAGCG GGGGGAGAAT TAGATCGATG 6 0 

GGAAAAAATT 7 Q 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 70 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 
{ D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(IV) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GGCACAGCAG ATCCGCCCGG GCTTACATCT CTGTACAAAT TTCTACTAAT GCTTTTATTT 
TTCTTCTGTC 

(2) INFORMATION FOR SEQ ID NO: 79: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 118 base pairs 

(B) TYPE: nucleic acid 
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: C ; STRANDEDNESS : hot h 
Z TOPOLOGY; both 

■1: MOLECULE TYPE: cDNA 

■::r HYPOTHETICAL : NO 

; iv: ANT I -SENSE : NO 



'xi': SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TCACCGTCCT TAGATCACCA TGGATGCAAT GAAGAGAGGG CTCTGCTGTG TGCTGCTGCT 6 0 

GTGTGGAGCA GTCTTCGTTT CGCCCAGCGA GATOTGCTGT GCCTTCTAGT TGCCAGCC 118 
■2; INFORMATION FOR SEQ ID NO : 8 C : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
; l l i ) HYPOTHETICAL : NO 
{iv) ANT I -SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 80: 
TTCGTTTCGC CCAGCGATCA CAGAAAAATT GTGGGTCACA GTC 
[2] INFORMATION FOR SEQ ID NO: 81: 

(i: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
i iv) ANTI -SENSE : NO 



[Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 81: 
CGCACAGCAG ATCCACGTGT TAGCGCTTTT CTCTCTCCAC CAC 



43 
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■ i : SEQUENCE CHARACTERISTICS: 
A LENGTH: 80 base pairs 
: = TYPE : nucleic acid 
C: STRANDEDNESS: both 
'Di TOPOLOGY, both 

!. ID MOLECULE TYPE ■ cDNA 

(111) HYPOTHETICAL : NO 

iiv ANTI -SENSE: NO 



(xi : SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TCACCGTCCT TAGATCTACC ATGGGACCAG TACAACAAAT AGGTGGTAAC TATGTCCACC 
TGCCATTAAG CCCGAGAACA 
(2 j INFORMATION FOR SEQ ID NO: 83: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

ill) MOLECULE TYPE : CDNA 
(in) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGCACAGCAG ATCTTTACAT TAATCTAGCC TTCTGTCCCG GTCC 
(2) INFORMATION FOR SEQ ID NO: 64: 

!l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

111) MOLECULE TYPE: cDNA 
fui) HYPOTHETICAL : NO 
iiv) ANT I -SENSE: NO 
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'.xi- SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
TCACCGTCCT TAGATCGGTA CAACCATGGG TGGAGCTATT TCCATGAGGC AATCCAAGCC 



GGCTGGAGAT CTGACAGAAA 3C 

•2) INFORMATION FOR SEQ ID NO : 8 5 : 

;i; SEQUENCE CHARACTERISTICS: 
(A! LENGTH: 35 base pairs 
(3 1 TYPE: nucleic acid 
(C: STRANDEDNESS : both 
(D) TOPOLOGY: both 

in) MOLECULE TYPE : cDNA 

(in) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GGCACAGCAG ATCACCTAGG TTAGCCTTCT TCTAACCTCT TCCTCTGACA GGCCTGACTT 6 0 

GCTTCCAACT CTTCTGGGTA TCTAG 35 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE : cDNA 
iiii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
ACCGTCCTTA GATTCGACAT AGCAGAATAG GCCTTACTCG ACAGAGGAGA GCAAGAAATG 60 
GAGCCAGTAG ATCCTAGACT AGAGCCCTGG 9 0 

(2'' INFORMATION FOR SEQ ID NO: 8^: 
ii SEQUENCE CHARACTERISTICS: 
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:A> LENGTH: 41 base pairs 

3 TYPE : nurleic acid 

{Z' STRAaJDEDNESS : both 

;D TOPOLOGY: both 

tii) MOLECULE TYPE : cDNA 

!lli) HYPOTHETICAL : NO 

(iv) ANTI -SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGCACAGCAG ATCCGAGATG CTGCTCCCAC CCCATCTGCT G 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

Slii) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Lys Gin lie He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Al 
15 10 15 



(2} INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(lii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(v) FRAGMENT TYPE : internal 
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xi: SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu Lys 
1 5 10 

INFORMATION FOR SEQ ID NO : 9 0 : 

m SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
{ B ) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

!ii! MOLECULE TYPE: peptide 

till) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

iv) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Asp Arg Val He Glu Val Val Gin Gly Xaa Tyr Arg Ala He Arg 
15 10 15 

[2) INFORMATION FOR SEQ ID NO: 91: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 91: 
GGTACAAATA TTGGCTATTG GCCATTGCAT ACG 
(2 1 INFORMATION FOR SEQ ID NO: 92: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
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ST HANDEDNESS : both 
TOFC LOGY: both 



i l 



MOLECULE TYPE: cDNA 



HYPOTHETICAL: NC 



ANT I- SENSE: NO 



SEQUENCE DESCRIPTION : SEQ ID NO: 92: 



CACATCTCG AGGAACCGGG TCAATTCTTC AGCACC 



2; INFORMATION FOR SE? ID NC:93: 

ii} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE : nurleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(li) MOLECULE TYPE: cDNA 
(in) HYPOTHETICAL: NO 
(ivj ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TACAGATA TCGGAAAGCC ACGTTGTGTC TCAAAATC 3B 
2) INFORMATION FOR SEQ ID NO : 9 4 : 

(i! SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY, both 

(ii) MOLECULE TYPE cDNA 
(in) HYPOTHETICAL: NO 
!iv) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 4 : 
CACATGGAT CCGTAATGCT CTGCCAGTGT TACAACC 3 7 

2' INFORMATION FOR SEQ ID NO: 95: 
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: SEQUENCE CHARACTERISTICS: 
■.A: LENGTH: 3 9 base pairs 
\ E ■ TYPE: nucleic acid 
;C! STRANDEDNESS: both 
ID) TOPOLOGY: both 

■ii; MOLECULE TYPE: cDNA 

nil) HYPOTHETICAL : NO 

i:v: ANTI - SENSE : NO 



:xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
GGTACATGAT CACGTAGAAA AGATCAAAGG A7CTTCTTG 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
(D> TOPOLOGY: both 

in) MOLECULE TYPE : cDNA 

ni.il HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
CCACATGTCG ACCCGTAAAA AGGCCGCGTT GCTGG 
(2) INFORMATION FOR SEQ ID NO:97 : 

( l ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
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■j A j — AA - AT AAATGT AC 



INFORMATION FCR SEQ ID NO : 9 8 : 

' l ; SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 15 base pairs 
(Bi TYPE: nucleic acid 
[C, STRANDEDNESS: both 
( D ) TOPOLOGY: both 

ill) MOLECULE TYPE: cDNA 

liii; HYPOTHETICAL: NO 

!iv: ant: -sense : no 



(x:) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CAATAGCAGG CATGC 

(2) INFORMATION FOR SEQ ID NO : 9 9 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

( D ) TOPOLOGY: both 

(n) MOLECULE TYPE: cDNA 
iiii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GCAAGCAGCA GATTAC 

[2) INFORMATION FOR SEQ ID NO: 100: 

til SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 354? base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : both 

!il) MOLECULE TYPE: cDNA 
iiii) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 
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(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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GTAAATGGCC 


CGCCTGGCTG 
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TATGTTCCCA 


TAGTAACGCC 


AATAGGGAfT 
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CGGTAAACTG 


crcACTTGnr 
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IVjLLAAu lA^. 
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LAI IAITjUU^ 


AG I AC ATGA _ 


CTTATGGGAC 
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U 1 At-<j irtl 1A 


G 1 >-AiUjL j. A 


i TACCATGGT 


GATGCGGTTT 
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1 jvj^ALj 1 ALA 


1 LAA ITjGOCLj 


rGGA TAGCG^j 


TTTGACTCAC 


GGGGATTTC I' 


AAGTCTCCAC 
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1 LAA 1 WA>AIj 


TTTGTTTTGG 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 
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<j A G AAA 1G 


GGCGCs 1 ACiOC 


GTG T AC GGT j 


GGAGGTCTAT 
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>jAAk_i_kj 1 L. Avj 




GACGCCATCC 


ACGCTGTTTT 
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AGGTGATGGT 


ATAGCTTAGC 


CTATAGGTGT 
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CCACTCCCCT 


ATTGGTGACG 


ATACTTTCCA 


TTACTAATCC 


AT AAC ATGG C 


TCTTTGCCAC 


1020 


AACTCTCTTT 


ATTGGCTATA 


TGCCAATACA 


CTGTCCTTCA 


GAGACTGACA 


CGGACTCTGT 


1080 


ATTTTTACAG 


GATGGGGTCT 


CATTTATTAT 


TTACAAATTC 


ACATATACAA 


CACCACCGTC 


1140 


CCCAGTGCCC 


GCAGTTTTTA 


TTAAACATAA 


CGTGGGATCT 


CCACGCGAAT 


CTCGGGTACG 


1200 


TGTTCCGGAC 


ATGGGCTCTT 


CTCCGGTAGC 


GGCGGAGCTT 


CTACATCCGA 


GCCCTGCTCC 


1260 


CATGCCTCCA 


GCGACTCATG 


GTCGCTCGGC 


AGCTCCTTGC 


TCCTAACAGT 


GGAGGCCAGA 


1320 


CTTAGGCACA 


GCACGATGCC 


CACCACCACC 


AGTGTGCCGC 


ACAAGGCCGT 


GGCGGTAGGG 


1380 


TATGTGTCTG 


AAAATGAGCT 


CGGGGAGCGG 


GCTTGCACCG 


CTGACGCATT 


TGGAAGACTT 


1440 


AAGGCAGCGG 


CAGAAGAAGA 


TGCAGGCAGC 


TGAGTTGTTG 


TGTTCTGATA 


AGAGTCAGAG 


1500 


GTAACTCCCG 


TTGCGGTGCT 


GTTAACGGTG 


GAGGGCAGTG 


TAGTCTGAGC 


AGTACTCGTT 


1560 
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GCTGTCGCGC GC3CCACCA3 ACATAATAGC TGACAGACTA AC AGACTGTT CCTTTCCATG 152: 

GGTCTTTTCT GCAGTCACCG TCCTTAGATC TGCTGTGCCT TCTAGTTGCC AGCCATCTGT 168C 

TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC CCTGGAAGGT GCCACTGCCA CTCTCCTTTC 17 4" 

C7AATAAAAT GAGGAAATTG CATCGCATTG TCTGAGTAG3 TGTCATTCTA TTCTGGGGGG 18QQ 

TGGGGTGGGG CAGCACAGCA AGGGGGAGGA TTGGGAAGA3 AATAGCAGGC ATCCTGGGGA 186 0 

TGCGGTGGGC TCTATGGGTA CGGCCGCAGC GGCCGTACCC AGGTGCTGAA GAATTGACCC 192 C' 

GGTTCCTCGA CCCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT 198: 

GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA 2 04 0 

AGATACCAGG CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 2100 

CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCAATGCTCA 2160 

CGCTGTAGG? ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 2 22C 

CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA GTCCAACCCG 2280 

GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG 2 340 

TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGG 2 40 0 

ACAGTATTTG GTATCTGCGC TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC 24 6 0 

TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG 2 52 0 

ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC GTGATCCCGT 258 0 

AATGCTCTGC CAGTGTTACA ACCAATTAAC CAATTCTGAT TAGAAAAACT CATCGAGCAT 2 64 0 

CAAATGAAAC TGCAATTTAT TCATATCAGG ATTATCAATA CCATATTTTT GAAAAAGC CG 27 0 0 

TTTCTGTAAT GAAGGAGAAA ACTCACCGAG GCAGTTCCAT AGGATGGCAA GATCCTGGTA 27 60 

TCGGTCTGCG ATTCCGACTC GTCCAACATC AATACAACCT ATTAATTTCC CCTCGTCAAA 23 20 

AATAAGGTTA TCAAGTGAGA AATCACCATG AGTGACGACT GAATCCGGTG AGAATGGCAA 28 80 

AAGCTTATGC ATTTCTTTCC AGACTTGTTC AACAGGCCAG CCATTACGCT CGTCATCAAA 29 4 0 

ATCACTCGCA TCAACCAAAC CGTTATTCAT TCGTGATTGC GCCTGAGCGA GACGAAATAC 3 000 

GCGATCGCTG TTAAAAGGAC AATTACAAAC AGGAATCGAA TGCAACCGGC GCAGGAACAC 3 0 60 

TGCCAGCGCA TCAACAATAT TTTCACCTGA ATCAGGATAT TCTTCTAATA CCTGGAATGC 312C 

TGTTTTCCCG GGGATCGCAG TGGTGAGTAA CCATGCATCA TCAGGAGTAC GGATAAAATG 3 18 0 

CTTGATGGTC GGAAGAGGCA TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT 3 24 0 
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AACA7CA77G 3CAACGCTAC ITTTGCCATG TTTC AGAAAT AACTCTGGCG CATCGGGCTT 12C: 

CCCATACAAT C3ATAGATTG TCGCACCTGA TTGCCCGACA TTATCGCGAG CCCATTTATA 3 2€0 

CCC ATATAAA TCAGCATCCA TGTTGGAATT TAATCGCGGC CTCGAGCAAG ACG7TTCCCG 3 42C 

TTGAATATGG CTCATAACAC CCCTTGTATT ACTGTTTATG TAAGCAGACA GTTTTATTGT 3 4 3 0 

TCATGATGAT ATATTTTTAT CTTGTGCAAT GTAACATCAG AGATTTTGAG ACACAACGTG 3 54C 

GCTT7CC 3547 
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WHAT IS CLAIMED IS: 

I . A polynucleotide which, upon introduction into a 
_ mammalian cell induces the co-expression in the cell of at least two gene 
products, comprising: 

a first transcriptional promoter which operates in 
eukaryotic cells upstream from, and in transcriptional control of. a first 
cistron; 

a second cistron downstream from the first cistron, 

10 

under transcriptional control either of the first transcriptional promoter or 
under control of a second transcriptional promoter; 

optionally, a third cistron downstream from the second 
cistron. under transcriptional control either of the first transcriptional 
^ promoter or under control of the second transcriptional promoter, or under 
control of a third transcriptional promoter; and 

a transcriptional terminator following each of the first, 
second and third cistron. 

2o 2. The polynucleotide of Claim 1 wherein the first cistron 

encodes at least one immunogenic epitope of a pathogen or a cancer 
associated antigen. 

3. The polynucleotide of Claim 2 wherein the pathogen is a 

virus. 
25 

4. The polynucleotide of Claim 3 wherein the virus is the 
human immunodeficiency virus (HIV). 

5. The polynucleotide of Claim 2 wherein the first cistron 

30 

encodes a human immunodeficiency virus (HIV) gene selected from env, 
gag. gag/pol, gag/protease, gag and portions of pol not encoding a 
functional polymerase, and pol. 
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6. The polynucleotide of Claim ] wherein the second 
cistron encodes a human immunodeficiency virus (HIV) REV cene if the 
first cistron encodes an HIV gene, the efficient expression of which is 
dependent on availability within the cell expressing the HIV eene of the 
REV gene product. 

7. The polynucleotide of Claim 6 wherein the first cistron 
encodes an HIV late gene selected from env, gag and pol. 

s - The polynucleotide of Claim 7 wherein the first cistron 
encodes HIV gp!60, HIV gpl20, HIV g P 41, HIV gpl20 lacking a CD4 
binding site and HIV env with an immunologically altered V3, the altered 
V3 having an altered glycosylation pattern or substituted V3 loop tips. 

9 - The polynucleotide of Claim 6 wherein the third cistron 
encodes a cytokine or a T-cell costimulatory element. 

1 0. The polynucleotide of Claim 9 wherein the cytokine is 
interferon, GM-CSF. or interleukin. 

20 

1 I . The polynucleotide of Claim 9 wherein the T-cell 
costimulatory element is a gene encoding a B7 protein. 

25 12 - Th e polynucleotide of Claim I wherein the first cistron 

encodes a REV-independent human immunodeficiency (HIV) epitope, the 
second cistron encodes a cytokine, and the third cistron encodes a T-cell 
costimulatory element, wherein each of the cistrons may also be presented 
in a different order. 

13. The polynucleotide of Claim 12 wherein the second 
cistron encodes an interleukin, an interferon, or GM-CSF. and the third 
cistron encodes a B7 protein. 
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14. The polynucleotide of Claim 1 wherein either of the 
second and third cistron is under transcriptional control of the 
transcriptional promoter upstream of the first cistron. a sequence is 
provided upstream of each of the second and third cistrons having the 
function of an internal ribosome entry site (IRES) to effect efficient 
translation of the second and third cistrons on a bi- or tri-cistronic 
messenger RNA transcribed from the beginning of the first cistron through 
each of the second and third cistrons up to the transcriptional terminator 
following the second or third cistron. 

10 

15. The polynucleotide of Claim 14 wherein the IRES is 
selected from encephalomyocarditis virus fEMCV) IRES, swine vesicular 
virus IRES and poliovirus rRES. 

16. The polynucleotide of Claim 14 wherein the first cistron 
encodes a human immunodeficiency virus (HIV) REV dependent gene, the 
second cistron encodes REV, and the third cistron encodes a T-cell 
costimulatory element or a cytokine, and further, wherein the first cistron 
is preceded by a transcriptional promoter and the second and third cistrons 
are each preceded by an IRES and no transcriptional promoter. 

17. The polynucleotide of Claim 16 wherein the first cistron 
encodes an HIV gpl60, the first cistron is preceded by cytomegalovirus 
immediate early promoter, the second cistron encodes HIV REV, the 
optional third cistron encodes an interferon, GM-CSF, an interleukin, or a 
B7 protein. 

1 8. A polynucleotide which comprises contiguous nucleic 
acid sequences which cannot replicate in eukaryotic cells but which are 
capable of being expressed to produce a gene product upon introduction of 
the polynucleotide into eukaryotic tissues in vivo , wherein the gene product 
either acts as an immunostimulant or as an antigen capable of generating an 
immune response, wherein the nucleic acid sequences encode: 

a spliced REV gene; 
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a spliced human immunodef iciency virus (HIV) 
immunogenic epitope; and 

optionally, a cytokine or a T-cell recognition element. 

3 19. The polynucleotide of Claim IS wherein the HIV 

immunogenic epitope selected from gag, gag-protease, or env or an 
immunogenic subportion thereof; the cytokine is interleukin-12, and the T- 
cell costimulatory element is a B7 protein. 

10 20. The polynucleotide of Claim 19 wherein the env 

immunogenic epitope is selected from HIV gpl60. HIV gpl20 and HIV 
gp41. 

21 . The polynucleotide of Claim 19 wherein the gag 
immunogenic epitope is pi 7, p24, or pi 5. 

22. A polynucleotide comprising a first gene encoding an 
HIV gag, gag-protease, or env immunogenic epitope, the gene containing a 
REV responsive element (RRE) or having been modified to contain an 

2C tlRE, the gene being operatively linked with a transcriptional promoter 
suitable for gene expression in a mammal, the gene being linked with an 
internal ribosome entry site (IRES), and the IRES being linked with a gene 
encoding a REV gene product. 

25 

23. The polynucleotide construct: 

a) Vl.Ins-rgv lHB . which has the junction sequence SEQ.ID:56: 

5 -GGA GAC AGC GACGAA GAC CTC CTC AAG GCA GTC AGA CTC ATC AAG-3 , and 
3 %EQ.ID:57: 

5 -GAT GGC TGG CAA CTA GAA GGC ACA GCA GAT CT/ GAT ATC GCA CTA 
BGH rev,.. 

TTC TTT AGC TCC TG A CTC CAA TAT TGT-3' 
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b.i Vl.lns-gplfiOlHB . which has the junction sequence SEQ.ID;58: 

5'-CTT AG A TC' A ACC ATG AG A GTG AAG GA GAA ATA TCA GCA CTT GTG 
5 CMVinta gpl60 

GAG ATG GGG GTG GAG ATG GGG CAC CAT GCT CCT TGG GAT GTT GAT GAT CTG 
TAG TGC TAG AGA AAA ATT GTG GGT-3'. 

10 and SEQ.ID:5S>: 

5 -CTG GCA ACT AGA AGG CAC AGC AGA TC/ A GAT ACT GTC CCC ATC TTA 
BGH gpl60 

TAG CAA A AT CCT TTC CAA GCC CTG TCT TAT TCT-3 
15 

c) nGEM-3-IRES . which has the junction sequence SEQ.ID:62: 

5 -CAT GCC TGC AGG TCG ACT CTA/ AAT TCC G... 
pGEM-3 (SP6) IRES 

20 

and SEQ-ID:63: 

5 -A CCC GGG GAT CCT CT7 A GCG CGC TTG TCT CTT GTT CCA... 
pGEM-3 (T7) IRES 

25 d) nGEM-3-IRES/rgvlTTR . which has the junction sequence SEQ.ID:65: 

5 -TAT GGC CAC AAC CI AT GGC AGG AAG AAG CGG AGA CAG CGA CGA AGA 

IRES rev 

CCT CCT CAA GGC AGT CAG ACT -3' 
30 

and SEQ-ID:66: 

5 -CTC GAG CCA TGG GCC CCT/ AGA CTA TAG CGT GAT AAG AAA TCG AGG 
pGEM-3 rev 
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ACT GAG GTT ATA AC A TCC TCT A AG GTG GIT ATA A AC TCC CGA AGG- 3' 

ej pGEM-3-RRE/IRES/revniH , which has the junction sequence SEQ.ID:68: 
? -TTG CAT GCC TGC AGG T/ GGT ACA TGA TCA GAT ATC G CCC GGG / C 
5 pGEM-3 RRE 

CGA GAT CTT CAG ACT TOG AGG AGG AGA TAT GAG GGA CAA TTG GAG- 3 
IRES-5' 

1Q and SEQ.ID:69: 

5 -GGG GCG GAA TT/ T AGA GTC A/ ATT GAT CAG CTT GTG TAA TTG TTA 

RRE-3' 

ATT TCT CTG TCC CAC TCC ATC CAG GTC GTG TGA TTC...-3' 
15 

0 Vl.\ns-(tat/rev SD) . which has the junction sequence SEQ.ID:71: 

5 - AG A TCT A AGG ACG GTG ACT GCA / TGT ACT ACT TAC TGC TTT GAT 
CMVintA tatfrev SD 

20 

AGA GGA CGG TGA / CTG CAG AAA AGA CCC ATG GAA A-3 

CMVintA 

g) VUns-gnl60 iHB/IRES/rgvniB (SD) . which has the junction sequence 
2 %EQ.ID:73: 

5-GGC ACA GCA GAT C/ AG ATG GGG ATC TGA TA TCG CAC TAT TCT TTA 

BGH rev 
GCT CCT GAC TCC TGA CTC-3 

30 andSEQ.ID:74: 

5 -GGA ATT/ TGA GTC ATC / CCC ATC TTA TAG CAA A AT CCT TTC CAA -3' 
IRES gpl60 

h) Vl.Ins-ggg-fjrrniB (SD) . which has the junction sequence SEQ.ID:75: 
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y-CTT AG A TO C CCC CAC GGC A AG AGG CGA GGG GCG GCG ACT GGT- V 
CMVintA gag fSD) 

and SEQ.ID:76: 

5 5'-GGC ACA GCA GAT C/ CGC CCG GGC TTA CAT CTC TGT ACA A AT TTC TAC 
BGH prt 

TAA TGC TTT TAT TTT TCT TCT GTC...-3' 

Vl.lns-gag-pniXJB, which has the junction sequence SEQ.ID:77; 

5'-CTT AGA TO CAC CAT GGG TGC GAG AGC GTC AGT ATT AA GCG GGG 
CMVintA gag 

L ^GA GAA TTA GAT CGA TGG GAA AAA ATT.. .-3' 
and SEQ.ID:7K: 

5-GGC ACA GCA GAT O CGC CCG GGC TTA CAT CTC TGT ACA AAT TTC TAC 
BGH p rt 

20 

TAA TGC TTT TAT TTT TCT TCT GTC .-3 

j) VUns-tPA which has the junction sequence SEQ.ID:79: 
5'-TCA CCG TCC TTA GAT C/ ACC ATG GAT GCA ATG AAG AGA GGG CTC TGC 
25 CMVintA tPA leader 

TGT GTG CTG CTG CTG TGT GGA GCA GTC TTC GTT TCG CCC AGC GA/ G ATC 

BGH 

30 TGC TGT GCC TTC TAG TTG CCA GCC-3 

k) Vl,lns-tPA-gpl20 MN , which has the junction sequence SEQ.rD:80: 

5 -TTC GTT TCG CCC AGC G A/ TCA CAG AAA AAT TGT GGG TCA CAG TO V 
tPA SP120MN 
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and SEQ.ID:81: 

5 GGC AC A GCA GAT C/ CAC GTG TTA GCG CTT TTC TCT CTC CAC CAC-3 
BGH gpl20MN 

1 > V1I-SIVMAC251 P^S gag , which has the junction sequence SEQTD:8 

5 -TCA CCG TCC TTA GAT CT/ ACC ATG GGA CCA GTA CAA CAA ATA GGT 

CMVintA p28 gag... 

GGT AAC TAT GTC CAC CTG CCA TTA AGC CCG AG A ACA-3' 
10 

and SEQTD:K3: 

5 GGC ACA GCA GAT CT/ TTA CAT TAA TCT AGC CTT CTG TCC CGG TCC-3 
BGH p28 gag 

15 m) VlJ-SIVMAC251fl ££, which has the junction sequence SEQ.ID:84: 

5-TCA CCG TCC TTA GAT C/ GGT ACA ACC ATG GGT GGA GCT ATT TCC ATG 

CMVintA nef 

AGG CAA TCC A AG CCG GCT GGA GAT CTG ACA GAA A-3' 
20 

andSEQTD:X5: 

5-GGC ACA GCA GAT CA/ C CTA GGT TAG CCT TCT TCT AAC CTC TTC CTC 

BGH nef.... 

TGA CAG GCC TGA CTT GCT TCC AAC TCT TCT GGG TAT CTA G-3 
25 

n) Vl.]ns-tat/rev/env : , which has the junction sequence SEQ.ID:86: 
5 -ACC GTC CTT AGA T/ TC G AC ATA GCA GAA TAG GCG TTA CTC G AC AGA 
CMVintA tat/revienv 

30 

GGA GAG CAA GAA ATG GAG CCA GTA GAT CCT AGA CTA GAG CCC TGG-.V 



.indSEQ.ID:S7: 

5 -GGC ACA GCA GAT C/ C GAG ATG CTG CTC CCA CCC CAT CTG CTG-3 
BGH tatlrevlenv 
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24. A polynucleotide which induces anti-HIV neutralizing 
antibody, HIV specific T-celJ immune responses, or protective immune 

^ responses upon introduction into vertebrate tissue, including human tissue 
in vivo , wherein the polynucleotide comprises a gene encoding a gene 
product selected from HIV gag, HIV gag-protease, and HIV env, the gene 
containing a REV responsive element (RRE), the gene being operatively 
linked with a transcriptional promoter suitable for gene expression in a 

^ mammal, the gene being linked with an internal ribosome entry site (IRES), 
and the IRES being linked with a second gene, the second gene encoding a 
REV gene product. 

25. A method for co-expression in a single cell in vivo , of at 
teast two gene products, which comprises introducing between about 1 ng 
and about 100 mg of the polynucleotide of Claim I into the tissue of the 
vertebrate. 

26. A method for inducing immune responses in a vertebrate 
against HIV epitopes which comprises introducing between about 1 ng and 
about 100 mg of the polynucleotide of Claim 6 into the tissue of the 
vertebrate. 



20 



27. A method for inducing immune responses in a vertebrate 
against HIV epitopes which comprises introducing between about 1 ng and 
about 100 mg of the polynucleotide of Claim 14 into the tissue of the 
vertebrate. 

28. A method for using a REV dependent HIV gene to 
^induce immune responses in vivo which comprises: 

a) isolating the REV dependent HIV gene; 

b) linking the isolated gene to regulatory sequences such 
that the gene is expressible by virtue of being operatively linked to control 
sequences which, when introduced into a living tissue, direct the 
transcription initiation and subsequent translation of the gene; 
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c) introducing the expressible gene into a living tissue; 

and 

d) introducing a gene encoding HIV REV either in trans 
_ or in cis to the HIV REV dependent gene. 

29. The method of Claim 28 which further comprises 
boosting with additional expressible HIV gene, or boosting with a 
recombinant purified HIV gene product. 

10 

30. The method of Claim 28 wherein the REV-dependent 
HIV gene encodes a gag, or an env gene product. 

31. A method for inducing immune responses against 

i mfection or disease caused by virulent strains of HIV which comprises 
introducing into the tissue of a vertebrate an HIV gene from a first HIV 
strain such that an induced immune response neutralizes infection by the 
first HIV strain but also neutralizes infection by strains heterologous to the 
first strain, wherein the HTV gene encodes a conserved, REV dependent 
HIV epitope and a functional REV is provided either in cis or in trans. 

32. A vaccine for inducing immune responses against HIV 
infection which comprises the polynucleotide of Claim 1 and a 
pharmaceutically acceptable carrier. 

25 

33. A method for inducing anti-HIV immune responses in a 
pnmate which comprises introducing the polynucleotide of Claim 1 into the 
tissue of the primate and concurrently administering interleukin 12 
parenterals. 

30 

34. The method of Claim 33 wherein the first cistron of the 
polynucleotide encodes HIV gpl60, the second cistron of the polynucleotide 
encodes HIV REV. and the third cistron of the polynucleotide encodes B7. 



35. A polynucleotide comprising: 
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a) an eukaryotic transcriptional promoter; 

b) an open reading frame 3' to the transcriptional 
promoter encoding an immunogenic HIV epitope wherein the open reading 

^ frame has a splice donor sequence at the 5'-side of the open reading frame, 
a REV responsive element anywhere within the open reading frame, and a 
stop codon encoding the termination of translation of the open reading 
frame; 

cj an internal ribosome entry site (IRES) 3' to the 
translation stop codon of the open reading frame; 

d) an open reading frame encoding a spliced HIV REV 
gene at the 3' end of which is a translation stop codon; 

e) optionally, 3' to the REV translation stop codon, a 
second IRES, followed by an open reading frame encoding 
immunomodulatory or immunostimulatory genes, the genes being selected 
from GM-CSF, IL-12, interferon, and a B7 protein; 

f) a transcription-termination signal following the last open 
reading frames. 

2Q 36. A method of inducing an antigen-presenting cell to 

stimulate cytotoxic and helper T-cell proliferation effector functions, the 
functions comprising lymphokme secretion specific to HIV antigens, the 
method comprising: 

a) exposing cells of a vertebrate in vivo to a 

polynucleotide, the polynucleotide comprising sequences encoding an 
antigenic HIV epitope, optionally, HIV REV, and sequences encoding a B7 
protein. 

37. The method of Claim 36 wherein the HIV epitope is 
^selected from env, gag, and pol. 

38. The method of Claim 36 wherein the polynucleotide 
encodes an IRES between each of the HIV epitope, the REV, and the B7 
protein. 
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39. A polynucleotide which comprises sequences encoding: 

a) an eukaryotic transcription initiation signal; 

b) an HIV gene open reading frame (ORF) preceded by 
an heterologous leader sequence such that expression of the HIV gene ORF 
does not depend on availability of the HIV REV gene product; 

c) a sequence which operates as an internal nbosome 
entry site (IRES) 3' to the translation stop codon of the HIV ORF; 

d) a sequence encoding an ORF of a T-cell costimulatory 

element 3' to the IRES; and 
io 

e) a transcription termination signal 3* to the translation 
stop codon of the T-cell costimulatory element. 

40. The polynucleotide of Claim 39 wherein the HIV gene 
ORF in (b) is tPAgplZO or tPAgpl60. 

41. A polynucleotide which comprises sequences encoding: 

a) an eukaryotic transcription initiation signal; 

b) an HIV gene open reading frame (ORF) preceded by 
an heterologous leader sequence such that expression of the HIV gene ORF 
does not depend on availability of the HIV REV gene product; 

c) a sequence which operates as an internal ribosome entry site (IRES) 3' to 
the translation stop codon of the HIV ORF; 

d) an HIV gene open reading frame (ORF) preceded by an heterologous 

^ Reader sequence such that expression of the HIV gene ORF does not depend 
on availability of the HIV REV gene product; and 

e) a transcription termination signal 3' to the translation stop codon of the 
HIV gene ORF. 

3Q 42. A composition comprising multiple expression constructs 

each of which is capable of inducing expression in mammalian tissue of 
more than a single cistron encoding antigens related to disease causing 
pathogens or tumors. 



WO 95/24-185 



PCT/XS95/02633 



- 14c ■ 



43. A method for immunization of a host vertebrate 
comprising the step of: 

introducing into direct contact with tissue of the host a non-infectious, non- 
integrating polynucleotide encoding at least a first and a second peptide or 
polypeptide, each of which is immunogenic or immunomodulatory when 
produced as a translation products in the host wherein the first peptide or 
polypeptide is encoded by a segment of the polynucleotide which is under 
operative control of a first transcriptional promoter and the second peptide 
or polypeptide is encoded by a segment of the polynucleotide under 
operative control of the first transcriptional promoter, in which case no 
transcriptional terminator is provided between the polynucleotide segment 
encoding the first peptide or polypeptide and the segment of polynucleotide 
encoding the second peptide or polypeptide, or the second peptide or 

polypeptide is encoded by a segment of the polynucleotide under operative 
control of a second transcriptional promoter, in which case a transcriptional 
terminator is provided between the segment of polynucleotide encoding the 
first peptide or polypeptide and the segment of polynucleotide encoding the 
second peptide or polypeptide, whereby both of the first and second peptide 

Q or polypeptide are produced within a single cell of the host, resulting in the 
immunization. 

44. A polynucleotide construct having the elements shown in 
figure 2, wherein each of the first, second and third cistrons shown in the 
figure encode a combination of any two to three of the following: 

1) tPA-g P 120MN: 

2) gpl60lHB/IRES//?£l/niB; 

3) gpl60lHB; 

4) REVniB- 

5) tat/REV/gpl60; 
: 6) REV/gpI60; 

1) gpl60MN; 

8) gpI60 from clinically relevant primary HIV isolates; 

9) nef, using the gene from clinically relevant strains; 

10) .wrinB; 
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11) tPA-gpl20niB: 

12) gpl60 with structural mutations including V3 loop 
substitutions from clinically relevant strains of HIV; 

5 several mutations on several constructs such as variable 

loop removal. Asn mutations to remove stenc carbohydrate 
obstacles to structural, neutralizing antibody epitopes; 
and CD4 binding site knockout mutants; 

13) gp41 with provision of appropriate leader sequences, 
1Q as in the tPA signal peptide leader sequence; 

14) xax: similar to construct from #5 above, using the 
gene from clinically relevant strains; 

15) rev: for gpl60 and gag dicistronics; 

16) B7 coding sequences; 

17) GM-CSF sequences; 

18) Interleukin sequences; 

19) Tumor associated antigens; 

20) Genes encoding antigens expressed by pathogens other 
than HIV, such as, but not limited to, influenza virus 

20 nucleoprotein, hemagglutinin, matrix, neuraminidase, and 

other antigenic proteins; herpes simplex virus genes; human 
papillomavirus genes; tuberculosis antigens; hepatitis A, B, 
or C virus antigens; and combinations of these and other 
antigens to form at least dicistronic constructs which may 

25 be combined with multiple other polycistronic constructs to 

provide a cocktail composition capable of raising immune 
responses against all of the represented pathogens or 
tumor antigens; 

wherein the segments A and B of figure 2 are internal ribosome entry sites 
3 or a combination of transcription termination sequences terminating the 
transcription of the upstream cistron and transcriptional promoter 
sequences, initiating the transcription of downstream cistron. 
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1 TCGCGCGTTI CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 

51 GACACCGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCC 

101 TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TCGCCCCTGC CTTAACTATG 

151 CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGCG GTGTGAAATA 

201 CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGATTGG CTATTGGCCA 

251 TTCCATACGT TGTATCCATA TCATAATATG TACA7TTATA TTGGCTCATG 

301 TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTAGI TATTAATACT 

351 AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGCA GTTCCGCGTT 

401 ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 

451 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 

501 CTTTCCATTG ACCTCAATGG GTGGAGTATT TACCGTAAAC TGCCCACTTG 

551 GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 

601 TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATC ACCTTATGGC 

651 ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 

701 GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 

751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 

801 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 

851 TTGACGCAAA TGGCCGGTAG GCGTGTACGG TGGGAGGTCT ATATAAGCAG 

901 AGCTCGTTTA GTGAACCGTC AGATCGCCTG GAGACGCCAT CCACGCTGTT 

951 TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 
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1001 CCCTCCAiTG CAACGCCCAT TCCCCGTGCC AAGAGTGACG IAAGTACCGC 
1051 CTATACACfC TATAGGCCCA CCCCCTTGGC TTCTT AIGCA TGCTATACTC 
1101 TTTTTGGCTT GGGGTCTATA CACCCCCCCT TCCTCATGTT ATAGGTGATG 
1151 GTATAGCTTA GCCTATAGGT GTGGGTTATT GACCAITATT GACCACTCCC 
1201 CTATTGGTGA CGATACTTTC CATTACTAAT CCATAACAIG GCTCTTTGCC 
1251 ACAACTCTCT TTATTGGCTA TATGCCAATA CACTGICCTT CAGAGACTGA 
1301 CACGGACTCT GTATTTTTAC AGGATGGGGT CTCATTTATI ATTTACAAAT 
1351 TCACATATAC AACACCACCG TCCCCAGTGC CCGCAGTTTT TATTAAACAT 
1401 AACGTCGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG ACATGGGCTC 
1451 TTCTCCGGTA GCGGCGGACC TTCTACATCC GAGCCCTGCT CCCATGCCTC 
1501 CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGGCCA 
1551 GACTTAGGCA CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC 
1601 GTGGCGGTAG GGTATGTGTC TGAAAATGAG CTCGGGGAGC GGGCTTGCAC 
1651 CGCTGACGCA TTTGGAAGAC TTAACGCACC CCCAGAAGAA GATGCAGGCA 
1701 GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTCC CGTTGCGGTG 
1751 CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 
1801 GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 
1851 TGGGTCTTTT CTGCAGTCAC CGTCCTTAG ATCTGCTGTG CCTTCTAGTT 
1901 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA 
1951 GGTGCCACTC CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA 
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2001 TTGTCTCAGT AGGTCTCATT CTAIICTGGG GGGTGGCGTG GGGCAGCACA 
2051 GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG GCAICCCCTG 
2101 GGCTCTATGG GTACCCAGGT GCTGAAGAAT TGACCCGGTT CCTCCTGGGC 
2151 CAGAAAGAAG CAGGCACATC CCCTTCICTG TGACACACCC TGTCCACGCC 
2201 CCTGGTTCTT AGTTCCACCC CCACTCATAG GACACTCATA GCTCAGGAGG 
2251 CCTCCCCCTT CMTCCCACC CGCTAAAGTA CTTGGAGCGG TCTCTCCCTC 
2301 CCTCATCAGC CCACCAAACC AAACCTAGCC TCCAAGAGTG GGAAGAAATT 
2351 AAAGCAAGAT AGGCTATTAA GTGCAGAGGG AGAGAAAATG CCTCCAACAT 
2401 GTGAGGAAGT AATGAGAGAA ATCATAGAAT TTCTTCCCCT TCCTCGCTCA 
2451 CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC 
2501 TCAAAGCCGG TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA 
2551 GAACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT AAAAAGGCCG 
2601 CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 
2651 AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA 
2701 CCAGGCGTTT CCCCCTCCAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 
2751 TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG 
2801 CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT ACGTCGTTCC 
2851 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG 
2901 CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA 
2951 TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATCT 

FIG.6C 

SUBSTITUTE SHEET (RULE 26) 



WO 95/24-185 



PCT/1S95/02633 



9/29 

3001 AGGCGGTGCT ACAGAGTTCT TCAAGTCCIC CCCTAACTAC GGCTACACTA 
3051 GAAGGACAGT ATTTGGTATC TCCCCTCTGC IGAAGCCAGT IACCTTCGGA 
3101 AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG 
3151 TGGTTTTTTT GTTTGCAAGC ACCACATTAC GCGCAGAAAA AAAGGATCTC 
3201 AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA 
3251 AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC 
3301 CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAACTATAT 
3351 ATGAGTAMC TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT 
3401 ATCTCAGCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCCCGT 
3451 CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC CCCAGTGCTG 
3501 CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGCAATA 
3551 AACCAGCCAG CCGGAACCGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC 
3601 CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT 
3651 CGCCAGTTAA TAGTTTGCGC AACGTTGTTG CCATTGCTAC AGGCATCGTG 
3701 GTGTCACGCT CGTCCTTTGC TATGGCTTCA TTCAGCTCCG GTTCCCAACG 
3751 ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA GCGGTTAGCT 
3801 CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGCCCGC AGTGTTATCA 
3851 CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT 
3901 AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT 
3951 AG TGTATGCG GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT 
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4001 ACCGCGCCAC ATAGCAGAAC TTTAAAACIC CTCATCATTG GAAAACGTTC 

4051 TTCGGGGCGA AAACTCTCAA GGATCT TACC GCTGTTGAGA TCCAGTTCGA 

4101 TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 

4151 AGCGTTTCTG GGTGAGCAAA AACAGGAACG CAAAATGCCG CAAAAAAGGG 

4201 AATAAGGCCG ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT 

4251 ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT 

4301 GAATGTATTT AGAAAAATAA ACAAATAGGG GTTCCCCGCA CATTTCCCCG 

4351 AAAAGTGCCA CCTGACGTCT AAGAAACCAT TATTATCATG ACATIAACCT 

4401 ATAAAAATAG CCCTATCACC AGGCCCTTTC GTC 
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1 TCGCCCCTTT CGGTGATGAC GGIGAAAACC TCTGACACAT GCAGCTCCCG 

51 GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 

101 TCAGGGCGCG TCAGCCCCTG TTCGCGGGTG TCCCCGCTGC CTTAACTATG 

151 CGGCATCAGA GCAGATTGTA CTGAGACTGC ACCATATGCG GTGTGAAATA 

201 CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGATTGG CTATTGGCCA 

251 TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 

301 TCCAACATTA CCCCCATGTT GACATTGATT ATTGACTAGT TATTAATAGT 

351 AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT 

401 ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 

451 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 

501 CTTTCCATTG ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTG 

551 GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 

601 TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG ACCTTATGGG 

651 ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 

701 GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 

751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 

801 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 

851 TTGACGCAAA TGGGCGGTAG GCGTGTACGG TGGGAGGTCT ATATAAGCAC 

901 AGCTCGTTTA GTGAACCGTC AGATCGCCTG GAGACGCCAT CCACGCTGTT 

951 TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 
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1001 CGGIGCATTG GAACGCGGAT TCCCCCTCCC AAGAGTGACG TAAGIACCGC 
1051 CTATAGAGTC TATAGGCCCA CCCCCTTCGC TTCTTATGCA TGCIATACTG 
1101 TTITTGGCTT GGGGTCTATA CACCCCCGCT TCCTCATGTT AIAGGTCATG 
1151 GTATAGCTTA GCCTATAGGT GTGGCTTATT GACCATTATT GACCACTCCC 
1201 CTATTGGTGA CGATACTTTC CATTACTAAT CCATAACATG GCTCTTTGCC 
1251 ACAACTCTCT TTATTCCCTA TATGCCAATA CACTGTCCTT CAGAGACIGA 
1301 CACGCACTCT GTATTTTTAC AGGATGGGGT CTCATTTATT ATTIACAAAT 
1351 TCACATATAC AACACCACCG TCCCCAGTGC CCGCAGTTTT TATTAAACAT 
1401 AACGTGGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG ACATGGGCTC 
1451 TTCTCCGGTA CCGGCGGAGC TTCTACATCC GAGCCCTGCT CCCATGCCTC 
1501 CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGGCCA 
1551 GACTTAGGCA CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC 
1601 GTGGCGGTAG GGTATGTGTC TGAAAATGAG CTCGGCGAGC GGGCTTGCAC 
1651 CGCTGACGCA TTTGGAAGAC TTAAGGCAGC GGCAGAAGAA GATGCAGGCA 
1701 GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTCC CGTTCCGGTC 
1751 CTGTTAACGG TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 
1801 GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 
1851 TGGGTCTTTT CTGCAGTCAC CGfCCTTAG ATCTGCTGTG CCTTCTAGTT 
1901 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA 
1951 GCTGCCACTC CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA 
2001 TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG GGGCAGCACA 
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2051 GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG GGATGCGGTG 

2101 GGCTCTATGG GTACCCAGCT GCTGAAGAAT TGACCCGGTT CCTCCTGGGC 

2151 CAGAAAGAAG CAGGCACATC CCCHCTCTG TGACACACCC TGTCCACGCC 

2201 CCTGGTTCTT AGTTCCACCC CCACTCATAG GACACTCATA GCTCAGGAGG 

2251 GCTCCGCCTT CAATCCCACC CGCTAAAGTA CTTGGAGCCG TCTCTCCCTC 

2301 CCTCATCAGC CCACCAAACC AAACCTAGCC TCCAAG.AGTG GGAAGAAATT 

2351 AAAGCAAGAT AGGCTATTAA GTGCAGAGGG AGAGAAAATG CCTCCAACAT 

2401 GTGAGGAAGT AATGAGAGAA ATCATAGAAT TTCTTCCGCT TCCTCGCTCA 

2451 CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC 

2501 TCAAAGGCGG TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA 

2551 GAACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT AAAAAGGCCG 

2601 CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 

2651 AATCGACGCT CAAGTCACAG GTGGCGAAAC CCGACAGCAC IATAAAGATA 

2701 CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 

2751 TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG 

2801 CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG 

2851 CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG 

2901 CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA 

2951 TCGCCACTGG CAGCACCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT 

3001 AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 

3051 GAAGGACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA 
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3101 AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGCIAGCGG 
3151 TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC 
3201 AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACCCTCA GTGGAACGAA 
3251 AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC 
3301 CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT 
3351 ATGAGTAAAC 7TGGTCTCAC AGTTACCAAT GCTTAATCAG TGAGGCACCT 
3401 AICTCACCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCGGGC 
3451 GGGGGGGGCG CTGAGGTCTG CCTCGTGAAG AAGGTGTTGC TGACTCATAC 
3501 CAGGCCTGAA TCGCCCCATC ATCCAGCCAG AAAGTGAGGG AGCCACGGTT 
3551 GATGAGAGCT TTGTTGTAGG TGGACCAGTT GGTGATTTTG AACTTTTGCT 
3601 TTGCCACGGA ACGGTCTGCG TTGTCGGGAA GATGCGTGAT CTGATCCTTC 
3651 AACTCAGCAA AAGTTCGATT TATTCAACAA AGCCGCCGTC CCGTCAAGTC 
3701 AGCGTAATGC TCTGCCAGTG TTACAACCAA TTAACCAATT CTGATTAGAA 
3751 AAACTCATCG AGCATCAAAT GAAACTGCAA TTTATTCATA TCAGGATTAT 
3801 CAATACCATA TTTTTGAAAA AGCCGTTTCT GTAATGAAGG AGAAAACTCA 
3851 CCGAGGCAGT TCCATAGGAT GGCAAGATCC TGGTATCGGT CTGCGATTCC 
3901 GACTCCTCCA ACATCAATAC AACCTATTAA TTTCCCCTCG TCAAAAATAA 
3951 GGTTATCAAG TGAGAAATCA CCATGAGTGA CGACTGAATC CGGTGAGAAT 
4001 GGCAAAAGCT TATGCATTTC TTTCCAGACT TGTTCAACAG GCCAGCCATT 
4051 ACGCTCGTCA TCAAAATCAC TCGCATCAAC CAAACCGTTA TTCATTCCTG 
4101 ATTGCGCCTG AGCGAGACGA AATACGCGAT CGCTGTTAAA AGGACAATTA 
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4151 CAAACAGGAA ICGAATGCAA CCGGCGCAGG AACACTGCCA GCGCATCAAC 

4201 AATATTTTCA CCTGAATCAG GATATTCTTC TAAIACCTGG AATGCTGTTT 

4251 TCCCGGGGAT CGCAGTGGTG AGTAACCATG CATCATCAGG AGTACGGATA 

4301 AAATGCTTGA TGGTCGGAAG AGGCATAAAT TCCGTCAGCC AGTTTAGTCT 

4351 GACCATCTCA TCTGTAACAT CATTGGCAAC GCTACCTTTG CCATGTTTCA 

4401 GAAACAACTC TGCCCCATCC GGCTTCCCAT ACAATCGATA GATTGTCGCA 

4451 CCTGATTGCC CGACATTATC GCGAGCCCAT TTATACCCAT ATAAATCAGC 

4501 ATCCATGTTG GAATTTAATC GCGGCCTCGA GCAAGACGTT TCCCGTTGAA 

4551 TATGGCTCAT AACACCCCTT GTATTACTGT TTATGTAAGC AGACAGTTTT 

4601 ATTGTTCATG ATGATATATT TTTATCTTGT GCAATGTAAC ATCAGACATT 

4651 TTGAGACACA ACGTGGCTTT CCCCCCCCCC CCATTATTGA AGCATTTATC 

4701 AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT 

4751 AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAACTCC CACCTGACGT 

4801 CTAAGAAACC ATTATTATCA TGACATTAAC CTAfAAAAAT AGGCGTATCA 

4851 CGAGGCCCTT TCGTC 
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1 ATTGGCTATT GGCCATTGCA TACGTTGTAT CCATATCATA AIATGTACAT 
51 TTATATIGGC TCATGTCCAA CATTACCGCC ATGTTGACAT TGATTATTGA 
101 CTAGTTATTA ATAGTAATCA ATTACGGGGT CATTAGTTCA TAGCCCATAT 
151 ATGGAGTTCC GCGTTACATA ACTTACGGTA AATGGCCCGC CIGCCTGACC 

201 GCCCAACGAC CCCCGCCCAT TGACGTCAAT AATGACGTAT GTTCCCATAG 
251 TAACGCCAAT AGGGACTTTC CATTGACGTC AATGGGTGGA GTATTTACGG 
301 TAAACTGCCC ACTTGGCAGT ACATCAAGTG TATCATAICC CAAGTACGCC 

351 CCCTATTGAC GTCAATGACG GTAAATGGCC CGCCTGGCAT TATGCCCAGT 

401 ACATGACCTT ATGGGACTTT CCTACTTGGC AGTACATCTA CGTATTAGTC 

451 ATCGCTATTA CCATGGTGAT GCGGTTTTGG CAGTACAICA ATGGGCGTGG 

501 ATAGCGGTTT GACTCACGGG GATTTCCAAG TCTCCACCCC ATTGACGTCA 

551 ATGGGACTTT GTTTTGGCAC CAAAATCAAC GGGACTTTCC AAAATGTCGT 

601 AACAACTCCG CCCCATTGAC GCAAATGGGC GGTAGGCGTG TACGGTGGGA 

651 GGTCTATATA AGCAGAGCTC GTTTAGTGAA CCGTCAGATC GCCTGGAGAC 

701 GCCATCCACG CTGTTTTGAC CTCCATAGAA GACACCGGGA CCGATCCAGC 

751 CTCCGCGGCC GGGAACGGTG CATTGGAACG CGGATTCCCC GTGCCAAGAG 

801 TGACGTAACT ACCGCCTATA GAGTCTATAG GCCCACCCCC TTGGCTTCTT 

851 ATGCATGCTA TACTGTTTTT GGCTTGGGGT CTATACACCC CCGCTTCCTC 

901 ATGTTATAGG TGATGGTATA GCTTAGCCTA TAGGTGTGGG TTATTGACCA 

951 TTATTGACCA CTCCCCTATT GGTGACGATA CTTTCCATTA CTAATCCATA 

■ 1001 ACATGGCTCT TTGCCACAAC TCTCTTTATT GGCTATATGC CAATACACTG 
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1051 TCCTTCAGAG ACTCACACGG ACTCTGTAIT TTTACAGGAT GGGCTCTCAT 
HOT TTATTATTIA CAAATTCACA TA1ACAACAC CACCGTCCCC ACTCCCCGCA 
1151 GTTTTTATTA AACATAACGT GGGATCTCCA CGCGAATCTC GGGTACGTGT 
1201 TCCGGACATG GGCTCITCTC CCGTACCGGC GGAGCTTCTA CATCCCAGCC 
1251 CTGCTCCCAT CCCrCCACCG ACTCATGGTC GCTCGGCAGC TCCTTGCTCC 
1301 TAACAGTGGA GGCCAGACTT AGCCACACCA CGATGCCCAC CACCACCAGT 
1351 GTCCCCCACA AGGCCGTGGC GGTAGGGTAT GTGTCTGAAA ATGACCTCCG 
1401 GGAGCGGGCT TGCACCGCTG ACGCATTTGG AAGACTTAAG GCAGCGGCAG 
1451 AAGAAGATGC AGGCACCTGA GTTGTTGTGT TCTGATAAGA GTCAGAGGTA 
1501 ACTCCCGTTG CGGTGCTGTT AACGGTGGAG GGCAGTGTAG TCTGAGCAGT 
1551 ACTCGTTGCT GCCGCGCGCG CCACCAGACA TAATAGCTGA CAGACTAACA 
1601 GACTGTTCCT TTCCATGGGT CTTTTCTGCA GTCACCGTCC TTAGATCTG 
1651 CTGTGCCTTC TAGTTGCCAG CCATCTGTTG TTTGCCCCTC CCCCGTGCCT 
1701 TCCTTGACCC TGGAAGGTGC CACTCCCACT GTCCTTTCCT AATAAAATGA 
1751 GGAAATTGCA TCGCATTGTC TGAGTAGGTG TCATTCTATT CTGGGGGGTG 
1801 GGGTGGGGCA GCACAGCAAG GGGGAGGATT GGGAAGACAA TAGCACGCAT 
1851 GCTGGGGATG CGGTGGGCTC TATGGGTACC CAGGTGCTGA AGAATTGACC 
1901 CGGTTCCTCC TGGGCCAGAA AGAACCACGC ACATCCCCTT CTCTGTGACA 
1951 CACCCTGTCC ACGCCCCTGG TTCTTAGTTC CAGCCCCACT CATAGGACAC 
2001 TCATAGCTCA GGAGGGCTCC CCCTTCAATC CCACCCGCTA AAGTACTTGG 
2051 AGCGCTCTCT CCCTCCCTCA TCAGCCCACC AAACCAAACC TAGCCTCCAA 
2101 GAGTGGGAAG AAATTAAAGC AAGATAGGCT ATTAAGTGCA GAGGGAGAGA 
2151 AAATGCCTCC AACATGTGAG GAACTAATGA GAGAAATCAT AGAATTC 
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1 GAIATIGG CTATTGCCCA 

251 TTGCATACGT TGTATCCATA TCAIAATATG TACAFTTATA TTGCCTCATC 

301 ICCAACAITA CCGCCATGTT GACATTGATT ATTGACTAGT TATTAAIAGT 

351 AATCAATIAC GGGGTCATTA GTTCATAGCC CATATATGGA CTTCCGCGTI 

401 ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 

451 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 

501 CTTTCCATTG ACGTCAATCC GTGGAGTATT TACGGTAAAC TGCCCACTTC 

551 GCAGTACATC AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 

601 TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG ACCTTATGGG 

651 ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 

701 GTGATCCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 

751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 

801 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 

851 TTGACGCAAA TGGGCGGTAG GCGTGTACGG TGGGAGGTCT ATATAAGCAG 

901 AGCTCGTTTA GTGAACCGTC AGATCGCCTG GACACGCCAT CCACGCTGTT 

951 TTGACCTCCA TAGAAGACAC CGGGACCCAT CCAGCCTCCG CGGCCGGGAA 

1001 CGGTGCATTG GAACGCGGAT TCCCCGTGCC AAGAGTGACG TAAGTACCGC 

1051 CTATAGAGTC TATAGGCCCA CCCCCTTGGC TTCTTATGCA TGCTATACTG 

1101 TTTTTGGCTT GGGGTCTATA CACCCCCGCT TCCTCATGTT ATAGGTGATG 

1151 GTATAGCTTA GCCTATAGGT GTGGGTTATT GACCATTATT GACCACTCCC 

1201 CTATTGGTGA CGATACTTTC CATTACTAAT CCATAACATC GCTCTTIGCC 
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1251 ACAACTCTCT TTAUGGCTA TATGCCAATA CACTCTCCTT CAGAGACTGA 
1301 CACGGACTCT GTATTTTTAC AGGATGGGGT CTCATTTATT ATITACAAAT 
1351 TCACAIATAC AACACCACCC TCCCCAGTGC CCGCAGTITT TATTAAACAT 
1401 AACGTGGGAT CTCCACGCGA ATCTCGGGTA CGTGTTCCGG ACATGGGCTC 
1451 TTCTCCGGTA GCGGCGGAGC TTCTACATCC GAGCCCTGCT CCCATGCCTC 
1501 CAGCGACTCA TGGTCGCTCG GCAGCTCCTT GCTCCTAACA GTGGAGCCCA 
1551 GACTTAGGCA CAGCACGATG CCCACCACCA CCAGTGTGCC GCACAAGGCC 
1601 GTGGCGGTAG GGTATGTGTC TGAAAATGAG CTCCCGCACC GGGCTTGCAC 
1651 CGCTGACGCA TTTGGAAGAC TTAAGGCAGC GGCAGAAGAA GATGCAGGCA 
1701 GCTGAGTTGT TGTGTTCTGA TAAGAGTCAG AGGTAACTCC CGTTGCGGTG 
1751 CTGTTAACGC TGGAGGGCAG TGTAGTCTGA GCAGTACTCG TTGCTGCCGC 
1801 GCGCGCCACC AGACATAATA GCTGACAGAC TAACAGACTG TTCCTTTCCA 
1851 TGGGTCTTTT CTGCAGTCAC CGTCCTTAG ATCTGCTGTG CCirCTAGTT 
1901 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA 
1951 GGTGCCACTC CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA 
2001 TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG GGGCAGCACA 
2051 GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG GCATCCGGTG 
2101 GGCTCTATGG GTAC GGCCGCAGCGGCC GTACCCAGGT GCTGAAGAAT 
TGACCCGGTT CCTCGACCCGT AAAAAGGCCG 

2601 CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA CCATCACAAA 
2651 AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA 
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3001 
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PJTTAPArTA 


3051 


GAAGGACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT 


TAPPTTPfiGA 


3101 


AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG 


PTPPTAPPPP 
b 1 Ob 1 nbbbb 


3151 


TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA 


AAAPPATPTP 
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3201 


AAGAAGATCC TTTGATCTTT TCTACGTGATCC CGTAATGC 


TPTPPPAPTP 
1 b 1 bbbAb 1 b 


TTACAACCAA TTAACCAATT CTGATTAGAA 




3751 


AAACTCATCG AGCATCAAAT GAAACTGCAA TTTATTCATA 


IbAbbAt IAI 


3801 


CAATACCATA TTTTTGAAAA AGCCGTTTCT GTAATGAAGG 


APAA AAPTPA 
AbAAAAb IbA 


3851 
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PTPPPATTPP 
b 1 bbbn 1 1 LL 


3901 


GACTCGTCCA ACATCAATAC AACCTATTAA TTTCCCCTCG 


TPAAAAATAA 


3951 


GGTTATCAAG TGAGAAATCA CCATGAGTGA CGACTGAATC 


CGGTGAGAAT 


4001 


GGCAAAAGCT TATGCATTTC TTTCCACACT TGTTCAACAC 


GCCAGCCATT 


4051 


ACGCTCGTCA TCAAAATCAC TCGCATCAAC CAAACCGTTA 


TTCATTCGTG 


4101 


ATTGCGCCtG AGCGAGACGA AATACGCGAT CGCTGTTAAA 


AGGACAATTA 


4151 


CAAACAGGAA TCGAATGCAA CCGGCGCAGG AACACTGCCA 


GCGCATCAAC 
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4201 AATATTTTCA CCTGAATCAG GATATTCTTC TAATACCTGG AATGCTGTTT 

4251 TCCCGGGGAT CGCAGTGGTG AGTAACCATG CATCATCAGG AGTACGGATA 

4301 AAATGCTTGA TGGTCGGAAG AGGCATAAAT TCCGTCAGCC AGTTTAGTCT 

4351 GACCATCTCA TCTGTAACAT CATTGGCAAC GCTACCTTTG CCATGTTTCA 

4401 GAAACAACTC TGCCGCATCC GGCTTCCCAT ACAATCGATA GATTGTCGCA 

4451 CCTGATTGCC CGACATTATC GCGAGCCCAT TTATACCCAT ATAAATCAGC 

4501 ATCCATGTTG GAATTTAATC GCGGCCTCGA GCAAGACGTT TCCCGTTGAA 

4551 TATGGCTCAT AACACCCCTT GTATTACTGT TTATGTAAGC AGACAGTTTT 

4601 ATTGTTCATG ATGATATATT TTTATCTTGT GCAATGTAAC ATCAGAGATT 

4651 TTGAGACACA ACGTGGCTTT CC 
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